nf-core/differentialabundance
Differential abundance analysis for feature/ observation matrices from platforms such as RNA-seq
1.0.1
). The latest
stable release is
1.5.0
.
Introduction
This document describes the output produced by the pipeline. The directories listed below will be created in the results directory after the pipeline has finished. All paths are relative to the top-level results directory.
Report
This directory contains the main reporting output of the workflow.
Output files
report/
*.html
: an HTML report file named according to the value ofparams.study_name
, containing graphical and tabular summary results for the workflow run.
Plots
Stand-alone graphical outputs are placed in this directory. They may be useful in external reporting, publication preparation etc.
Output files
plots/
qc/
: Directory containing quality control plots from initial processing e.g. DESeq2*.png
exploratory/
: Directory containing standalone plots from exploratory analysis. Plots are stored in directories named for the main coloring variable used.[coloring variable]/png/boxplot.png
: Boxplot visualisation of abundance distributions[coloring variable]/png/density.png
: Density visualisation of abundance distributions[coloring variable]/png/pca2d.png
: 2-dimensional PCA plot[coloring variable]/png/pca3d.png
: 3-dimensional PCA plot[coloring variable]/png/sample_dendrogram.png
: A sample clustering dendrogram[coloring variable]/png/mad_correlation.png
: Outlier prediction plots using median absolute deviation (MAD)
differential/
: Directory containing standalone plots from differential analysis. Plots are stored in directories named for the associated contrast.[contrast]/png/volcano.png
: Volcano plots of -log(10) p value agains log(2) fold changes
gsea/
: Directory containing graphical outputs from GSEA (where enabled). Plots are stored in directories named for the associated contrast.[contrast]/png/[gsea_plot_type].png
Most plots are included in the HTML report (see above), but are also included in static files in this folder to facilitate use in external reporting.
Tables
Output files
tables/
processed_counts/
: Directory containing processed counts from initial processing from e.g. DESeq2[contrast_name].normalised_counts.tsv
: Normalised counts table[contrast_name].vst.tsv
: Normalised counts table with a variance-stabilising transform
differential/
: Directory containing tables of differential statistics reported by differential modules such as DESeq2[contrast_name].deseq2.results.tsv
: Results of DESeq2 differential analyis (RNA-seq)
gsea/
: Directory containing tables of differential gene set analyis from GSEA (where enabled)[contrast]/[contrast].gsea_report_for_[condition].tsv
: A GSEA report table for each side of each contrast
The differential
folder is likely to be the core result set for most users, containing the main tables of differential statistics.
Pipeline information
Output files
pipeline_info/
- Reports generated by Nextflow:
execution_report.html
,execution_timeline.html
,execution_trace.txt
andpipeline_dag.dot
/pipeline_dag.svg
. - Reports generated by the pipeline:
pipeline_report.html
,pipeline_report.txt
andsoftware_versions.yml
. Thepipeline_report*
files will only be present if the--email
/--email_on_fail
parameter’s are used when running the pipeline. - Reformatted samplesheet files used as input to the pipeline:
samplesheet.valid.csv
.
- Reports generated by Nextflow:
Nextflow provides excellent functionality for generating various reports relevant to the running and execution of the pipeline. This will allow you to troubleshoot errors with the running of the pipeline, and also provide you with other information such as launch commands, run times and resource usage.