nfcore/diaproteomics is a bioinformatics analysis pipeline used for quantitative processing of data independant (DIA) proteomics data.
The workflow is based on the OpenSwathWorkflow for SWATH-MS proteomic data. DIA RAW files (mzML) serve as inputs and library search is performed based on a given input spectral library. If specified internal retention time standarts (irts) will be used to align library and DIA measurements into the same retention time space. FDR rescoring is applied using Pyprophet based on a competitive target-decoy approach on peakgroup or global peptide and protein level. Finally, exctracted chromatograms are realigned and quantified using the TRIC algorithm.
nextflow run nf-core/diaproteomics --dia_mzmls '*.mzML' --spectral_lib '*.pqp' --irts '*.pqp' --swath_windows '*.txt' -profile test,docker
See usage docs for all of the available options when running the pipeline.
The nf-core/diaproteomics pipeline comes with documentation about the pipeline, found in the
- Pipeline configuration
- Running the pipeline
- Output and how to interpret the results
nf-core/diaproteomics was originally written by Leon Bichmann.
If you would like to contribute to this pipeline, please see the contributing guidelines.
You can cite the
nf-core pre-print as follows:
Ewels PA, Peltzer A, Fillinger S, Alneberg JA, Patel H, Wilm A, Garcia MU, Di Tommaso P, Nahnsen S. nf-core: Community curated bioinformatics pipelines. bioRxiv. 2019. p. 610741. doi: 10.1101/610741.