rnaseq: Parameters

Define where the pipeline should find input data and save output data.

Path to comma-separated file containing information about the samples in the experiment.

required

type: string

Path to the output directory where the results will be saved.

type: string

default: ./results

File containing SRA/ENA/GEO identifiers one per line in order to download their associated FastQ files.

type: string

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

Only download metadata for public data database ids and don’t download the FastQ files.

type: boolean

Save FastQ files after merging re-sequenced libraries in the results directory.

type: boolean

Options for processing reads with unique molecular identifiers

Enable UMI-based read deduplication.

type: boolean

UMI pattern to use. Can be either ‘string’ (default) or ‘regex’.

type: string

default: string

The UMI barcode pattern to use e.g. ‘NNNNNN’ indicates that the first 6 nucleotides of the read are from the UMI.

type: string

If this option is specified, intermediate FastQ and BAM files produced by UMI-tools are also saved in the results directory.

type: boolean

Options for filtering reads prior to alignment

Enable the removal of reads derived from ribosomal RNA using SortMeRNA.

type: boolean

Text file containing paths to fasta files (one per line) that will be used to create the database for SortMeRNA.

type: string

default: ${baseDir}/assets/rrna-db-defaults.txt

If this option is specified, intermediate FastQ files containing non-rRNA reads will be saved in the results directory.

type: boolean

Reference genome related files and options required for the workflow.

Name of iGenomes reference.

type: string

Path to FASTA genome file.

type: string

Path to GTF annotation file.

type: string

Path to GFF3 annotation file.

type: string

Path to BED file containing gene intervals. This will be created from the GTF file if not specified.

type: string

Path to FASTA transcriptome file.

type: string

FASTA file to concatenate to genome FASTA file e.g. containing spike-in sequences.

type: string

Splice sites file required for HISAT2.

type: string

Path to directory or tar.gz archive for pre-built STAR index.

type: string

Path to directory or tar.gz archive for pre-built HISAT2 index.

type: string

Path to directory or tar.gz archive for pre-built RSEM index.

type: string

Path to directory or tar.gz archive for pre-built Salmon index.

type: string

Memory passed to HISAT2 build process.

type: integer

default: 200

Specify if your GTF annotation is in GENCODE format.

type: boolean

If generated by the pipeline save the BWA index in the results directory.

type: boolean

Directory / URL base for iGenomes references.

hidden

type: string

default: s3://ngi-igenomes/igenomes/

Do not load the iGenomes reference config.

hidden

type: boolean

Options to adjust read trimming criteria.

Instructs Trim Galore to remove bp from the 5’ end of read 1 (or single-end reads).

type: integer

Instructs Trim Galore to remove bp from the 5’ end of read 2 (paired-end reads only).

type: integer

Instructs Trim Galore to remove bp from the 3’ end of read 1 AFTER adapter/quality trimming has been performed.

type: integer

Instructs Trim Galore to remove bp from the 3’ end of read 2 AFTER adapter/quality trimming has been performed.

type: integer

Instructs Trim Galore to apply the –nextseq=X option, to trim based on quality after removing poly-G tails.

type: integer

Skip the adapter trimming step.

type: boolean

Save the trimmed FastQ files in the results directory.

type: boolean

Options to adjust parameters and filtering criteria for read alignments.

Specifies the alignment algorithm to use - available options are ‘star’, ‘star_rsem’ and ‘hisat2’.

type: string

Specifies the pseudo aligner to use - available options are ‘salmon’. Runs in addition to ‘–aligner’.

type: string

When using pre-built STAR indices do not re-extract and use splice junctions from the GTF file.

type: boolean

Minimum percentage of uniquely mapped reads below which samples are removed from further processing.

type: integer

default: 5

Sequencing center information to be added to read group of BAM files.

type: string

Where possible, save unaligned reads from either STAR, HISAT2 or Salmon to the results directory.

type: boolean

Save the intermediate BAM files from the alignment step.

type: boolean

Skip picard MarkDuplicates step.

type: boolean

Skip all of the alignment-based processes within the pipeline.

type: boolean

Options for counting reads relative to gene features

‘–extraAtributes’ parameter passed to featureCounts.

type: string

default: gene_name

Define the attribute type used to group features in the GTF file.

type: string

default: gene_id

Define the attribute type used to group feature types in the GTF file.

type: string

default: gene_biotype

By default, the pipeline assigns reads based on the ‘exon’ attribute within the GTF file.

type: string

default: exon

Perform reference-guided de novo assembly of transcripts using StringTie i.e. dont restrict to those in GTF file.

type: boolean

Skip additional featureCounts process for biotype QC.

type: boolean

Skip featureCounts.

type: boolean

Options to skip various steps within the workflow.

Specify the RSeQC modules to run.

type: string

default: bam_stat,inner_distance,infer_experiment,junction_annotation,junction_saturation,read_distribution,read_duplication

Use vst transformation instead of rlog with DESeq2.

type: boolean

Skip bigWig file creation.

type: boolean

Skip StringTie.

type: boolean

Skip FastQC.

type: boolean

Skip Preseq.

type: boolean

Skip dupRadar.

type: boolean

Skip Qualimap.

type: boolean

Skip RSeQC.

type: boolean

Skip DESeq2 PCA and heatmap plotting.

type: boolean

Skip MultiQC.

type: boolean

Skip all QC steps except for MultiQC.

type: boolean

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional configs hostname.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Institutional config URL link.

hidden

type: string

Set the top limit for requested resources for any single job.

Maximum number of CPUs that can be requested for any single job.

hidden

type: integer

default: 16

Maximum amount of memory that can be requested for any single job.

hidden

type: string

default: 128.GB

Maximum amount of time that can be requested for any single job.

hidden

type: string

default: 240.h

Less common options for the pipeline, typically set in a config file.

Display help text.

hidden

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

Do not use coloured log outputs.

hidden

type: boolean

Custom config file to supply to MultiQC.

hidden

type: string

Directory to keep pipeline Nextflow logs and reports.

hidden

type: string

default: ${params.outdir}/pipeline_info

Arguments passed to Nextflow clusterOptions.

hidden

type: string

Run this workflow with Conda. You can also use ‘-profile conda’ instead of providing this parameter.

hidden

type: boolean

nf-core/rnaseq