MPRAlib is a Python library and CLI for processing MPRA (Massively Parallel Reporter Assay) data.
If you use MPRAlib in your work please cite out recent preprint:
Uniform processing and analysis of IGVF massively parallel reporter assay data with MPRAsnakeflow
Jonathan D. Rosen, Arjun Devadas Vasanthakumari, Kilian Salomon, Nikola de Lange, Pyaree Mohan Dash, Pia Keukeleire, Ali Hassan, Alejandro Barrera, Martin Kircher, Michael I. Love, Max Schubach
bioRxiv (2025). 2025.09.25.678548
pip install mpralibFrom the bioconda channel
conda install -c bioconda mpralibUse the mpralib command to access various functionalities.
MPRAlib provides a CLI tool for validating MPRA data files against supported schemas.
mpralib validate-file <schema> --input <input_file><schema>: One ofreporter-sequence-design,reporter-barcode-to-element-mapping,reporter-experiment-barcode,reporter-experiment,reporter-element,reporter-variant,reporter-genomic-element,reporter-genomic-variant<input_file>: Path to your data file (e.g.,.tsv.gz,.bed.gz)
Example:
mpralib validate-file reporter-sequence-design --input data/reporter_sequence_design.example.tsv.gzIn general MPRAlib is ment to be used as a library. Please have a look at our notebook mpralib.ipynb for a more detailed example.
MIT License