Workflow Ecosystem Tools Publications About
Quantitative Proteomics Platform

From Raw Spectra to Biological Insight

An open-source ecosystem for large-scale quantitative proteomics. Cloud-ready Nextflow pipelines, standalone Python libraries, and ML-powered tools — process DDA and DIA data with dedicated workflows, or use the standalone tools with any search engine.

Raw Data .mzML .raw .d Identification rescoring Quantification mokume Quality Control pmultiqc
ecosystem tools
# DDA proteomics pipeline (LFQ & TMT/iTRAQ)
nextflow run bigbio/quantms \
    -profile docker \
    --input samplesheet.csv \
    --database uniprot.fasta \
    --outdir results/

# Runs on laptops, HPC, or AWS/GCP/Azure
# DDA-LFQ, DDA-ISO (TMT/iTRAQ) workflows
# DIA proteomics pipeline (powered by DIA-NN)
nextflow run bigbio/quantmsdiann \
    -profile docker \
    --input samplesheet.csv \
    --database uniprot.fasta \
    --outdir results/

# Dedicated DIA workflow with DIA-NN engine
# Library-free or spectral library modes
# Convert search engine outputs to unified format
qpx convert \
    --tool diann \
    --report diann_report.tsv \
    --sdrf experiment.sdrf.tsv

# Supported: DIA-NN, MaxQuant, FragPipe,
#   Spectronaut, quantms, Sage, MSFragger
# Export to Parquet or AnnData (.h5ad)
# Protein quantification from any search engine
# Works with quantms, quantmsdiann, MaxQuant, FragPipe
mokume features2proteins \
    -p features.parquet \
    -o proteins.csv \
    -s experiment.sdrf.tsv \
    --quant-method maxlfq

# TMT with IRS + differential expression
mokume features2proteins \
    -p features.parquet \
    --quant-method median \
    --irs --de
# Interactive QC reports for any experiment
multiqc results/ \
    --module pmultiqc

# Generate mokume QC + comparison reports
mokume features2proteins \
    -p features.parquet \
    -s experiment.sdrf.tsv \
    --report qc_report.html

# HTML reports: PCA, heatmaps, CVs, volcanos
Scroll to explore

The Pipeline

A Nextflow-orchestrated workflow where specialized tools handle each step.

End-to-End Quantitative Proteomics

Two dedicated Nextflow pipelines: quantms for DDA (LFQ & TMT/iTRAQ) and quantmsdiann for DIA — but the downstream tools work independently too. Use mokume to quantify proteins from any search engine output. Use pmultiqc for QC on any experiment.

  • 1Convert & search raw spectra against protein databases
  • 2Rescore PSMs with ML models for higher sensitivity
  • 3Quantify proteins using iBAQ, MaxLFQ, TopN, or ratio methods
  • 4Generate QC reports and differential expression results
Read pipeline docs →

Raw Data Acquisition

.mzML · .raw · .d
quantms

Identification & Rescoring

MS2PIP · DeepLC · Percolator
rescoring

Quantification

iBAQ · MaxLFQ · TopN · Ratio
mokumeqpx

Quality Control & DE

Interactive reports · Volcano plots
pmultiqcmokume

Data & Exploration

Browse reanalyzed proteomics datasets and explore results interactively.

log2 Intensity 30 25 20 15 Liver Brain Heart Kidney Plasma Spleen Samples

Explore Reanalyzed Datasets

The quantms.org data portal hosts thousands of public proteomics datasets reanalyzed from ProteomeXchange repositories (PRIDE, MassIVE, jPOST). Browse absolute and differential expression across tissues, diseases, and organisms. All ecosystem tools also work locally for your own data analysis.

0
Datasets
0
Collections
0
Organisms
0
Proteins
Visit data portal →

AI-Powered Proteomics

Deep learning models trained on 500M+ spectra, powering the next generation of proteomics analysis.

From Data to Models

The quantms ecosystem reanalyzes thousands of public proteomics experiments, producing standardized datasets. These datasets train deep learning models that improve every step of the pipeline — from spectrum prediction to de novo sequencing.

500M+
Training Spectra
6
Released Models
4
AI Tools
Browse Models FTP Repository
Model Pipeline
Public Datasets
50,000+ experiments from ProteomeXchange
quantms Reanalysis
Standardized processing with quantms & quantmsdiann
AI Model Training
AlphaPeptDeep, AutoRT, GPTime, HelixNovo
Better Proteomics
10-30% more identifications, de novo sequencing, RT prediction

Standards & Formats

Built on open formats for reproducible, metadata-rich proteomics.

Reproducibility by Design

Every experiment in the ecosystem is described with SDRF (Sample and Data Relationship Format), ensuring complete metadata annotation. The qpx library bridges the gap between search engines — converting outputs from quantms, quantmsdiann, DIA-NN, FragPipe, MaxQuant, and Spectronaut into a unified format.

Convert outputs from any search engine into a unified Parquet format. Export to AnnData for integration with scverse and other analysis ecosystems.

qpx documentation →

SDRF — Sample Metadata

Standardized experimental design: organisms, tissues, labels, conditions. Used by all quantms tools for automatic sample grouping.

qpx — Format Conversion

Convert from DIA-NN, FragPipe, MaxQuant, Spectronaut, quantms. Unified Parquet output with PSM-level data.

AnnData — Interoperability

Export protein matrices to .h5ad for integration with scanpy, scverse, and the broader Python data science ecosystem.

Quantification Library

mokume — the protein quantification engine at the heart of quantms.

mokume

A comprehensive Python library for protein quantification from peptide-level data. Works standalone with output from any search engine — not just quantms. Supports 6+ quantification methods, normalization, IRS for multi-plex TMT, batch correction, and differential expression.

iBAQ MaxLFQ TopN DirectLFQ Ratio (TMT) IRS ComBat DE Analysis
$ pip install mokume
Documentation → GitHub →
mokume
# LFQ quantification
mokume features2proteins \
    -p features.parquet \
    -o proteins.csv \
    -s experiment.sdrf.tsv \
    --quant-method maxlfq

# TMT + IRS + DE analysis
mokume features2proteins \
    -p features.parquet \
    --quant-method median \
    --irs --de \
    --de-contrasts "NASH-HL"
from mokume.pipeline import (
    QuantificationPipeline, PipelineConfig
)
from mokume.pipeline.config import (
    InputConfig, QuantificationConfig
)

config = PipelineConfig(
    input=InputConfig(
        parquet="features.parquet",
        sdrf="experiment.sdrf.tsv",
    ),
    quantification=QuantificationConfig(
        method="maxlfq",
    ),
)
pipeline = QuantificationPipeline(config)
proteins = pipeline.run()
// Run quantms pipeline
nextflow run bigbio/quantms \
    -profile docker \
    --input samplesheet.csv \
    --database uniprot.fasta \
    --outdir results/

// Cloud-ready: AWS, GCP, Azure
// Nature Methods 2024

Visualization & Reports

Publication-ready QC reports and interactive visualizations.

From QC to Volcano Plots

pmultiqc generates interactive HTML reports with comprehensive QC metrics for every experiment. mokume adds differential expression reports, interactive plots, and workflow comparison dashboards.

  • Sample correlation heatmaps & PCA
  • CV distributions & missing value analysis
  • Volcano plots & differential expression tables
  • Workflow comparison across methods
  • Interactive HTML — no extra tools needed
mokume reports → pmultiqc →
log2 Fold Change −log10 p-value Down Up

The Ecosystem

Eight specialized tools, one unified platform.

quantms

Nextflow Pipeline

DDA proteomics: LFQ and TMT/iTRAQ workflows. Nature Methods 2024.

quantmsdiann

Nextflow Pipeline

DIA proteomics powered by DIA-NN. Library-free and spectral library modes.

mokume

Python Library

Protein quantification, normalization, batch correction, DE. Successor to ibaqpy.

qpx

Python CLI

Format conversion: DIA-NN, FragPipe, MaxQuant. AnnData export.

pmultiqc

MultiQC Plugin

Interactive HTML QC reports for LFQ, TMT, and DIA.

quantms-rescoring

ML Tool

PSM rescoring with MS2PIP, DeepLC, AlphaPeptDeep.

sdrf-pipelines

Python Library

SDRF validation, annotation, and conversion for proteomics metadata.

quantms.org

Web Portal

Browse reanalyzed proteomics datasets interactively.

Why quantms

Standards-First

SDRF metadata, open file formats, and reproducible workflows ensure every experiment is fully documented and shareable.

Large-Scale & Cloud-Ready

Process 50,000+ datasets at cloud scale. Nextflow orchestration with Docker/Singularity runs on laptops, HPC clusters, or AWS/GCP/Azure. Python libraries handle local analysis just as well.

Works With Your Tools

Not locked in. Use mokume and pmultiqc with data from quantms, quantmsdiann, MaxQuant, FragPipe, Spectronaut, or DIA-NN. Export to AnnData for the scverse ecosystem.

Publication-Ready

Interactive QC reports, volcano plots, PCA, heatmaps, and differential expression tables — ready for your next manuscript.

Open Science, Open Community