Amplicon sequencing analysis workflow using DADA2 and QIIME2
nfcore/ampliseq is a bioinformatics analysis pipeline used for 16S rRNA amplicon sequencing data.
The workflow processes raw data from FastQ inputs (FastQC), trims primer sequences from the reads (Cutadapt), imports data into QIIME2, generates amplicon sequencing variants (ASV, DADA2), classifies features against the SILVA v132 database, excludes unwanted taxa, produces absolute and relative feature/taxa count tables and plots, plots alpha rarefaction curves, computes alpha and beta diversity indices and plots thereof, and finally calls differentially abundant taxa (ANCOM). See the output documentation for more details of the results.
The pipeline is built using Nextflow, a workflow tool to run tasks across multiple compute infrastructures in a very portable manner. It comes with docker / singularity containers making installation trivial and results highly reproducible.
The nf-core/ampliseq pipeline comes with documentation about the pipeline, found in the
- Pipeline configuration
- Running the pipeline
- Output and how to interpret the results
These scripts were originally written for use at the Quantitative Biology Center (QBiC) and Microbial Ecology, Center for Applied Geosciences, part of Eberhard Karls Universität Tübingen (Germany) by Daniel Straub (@d4straub) and Alexander Peltzer (@apeltzer).