Global parameters for the pipeline.

Path to the samplesheet file used by the pipeline. The file should be a CSV, TSV, JSON or YAML file. Equivalent to the sampleAnnotation parameter in the snakemake pipeline.

type: string
pattern: ^\S+\.(csv|tsv|json|yaml|yml)$

You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. See usage docs.

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure. Equivalent to the root parameter in the snakemake pipeline.

type: string

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

Title of the project to be displayed on the rendered HTML output

type: string

Name of iGenomes reference. Equivalant to the genomeAssembly parameter in the snakemake pipeline.

type: string

If using a reference genome configured in the pipeline using iGenomes, use this parameter to give the ID for the reference. This is then used to build the full paths for all required reference genome files e.g. --genome GRCh38.

See the nf-core website docs for more details.

Path to FASTA genome file. Equivalent to the genome parameter in the snakemake pipeline.

type: string
pattern: ^\S+\.fn?a(sta)?(\.gz)?$

This parameter is mandatory if --genome is not specified. If you don't have a BWA index available this will be generated for you automatically. Combine with --save_reference to save BWA index for future runs.

Path to FASTA index file.

type: string
pattern: ^\S+\.fai$

Path to NCBI FASTA genome file. Equivalent to the ncbi option in the genome parameter in the snakemake pipeline.

type: string
pattern: ^\S+\.fn?a(sta)?(\.gz)?$

Path to NCBI FASTA index file.

type: string
pattern: ^\S+\.fai$

Path to UCSC FASTA genome file. Equivalent to the ucsc option in the genome parameter in the snakemake pipeline.

type: string
pattern: ^\S+\.fn?a(sta)?(\.gz)?$

Path to UCSC FASTA index file.

type: string
pattern: ^\S+\.fai$

Path to GTF gene annotations file. Equivalent to the geneAnnotation parameter in the snakemake pipeline, with the difference that a file depicting the kev-value pairs is expected here instead.

type: string
pattern: ^\S+\.tsv$

Full path of the file containing HPO terms. If missing, it reads it from our webserver. Refer to files-to-download. Equivalent to the hpoFile parameter in the snakemake pipeline.

type: string
pattern: ^\S+\.tsv$

Export counts options.

A comma-separated list of gene annotations to export.

type: string

A comma-separated list of aberrant expression and aberrant splicing groups whose counts should not be exported. If empty, all groups are exported.

type: string

Aberrant expression options.

Run aberrant expression analysis.

type: boolean

A comma-separated list of aberrant expression groups to run. If empty, all groups are run.

type: string

A positive number indicating the minimum number of samples that a group needs in order to be analyzed. We recommend at least 50.

type: integer
default: 1

A positive number indicating the minimum FPKM value for a gene to be considered expressed.

type: integer
default: 1

Methods to remove sample covariation in OUTRIDER.

type: string

A number between (0, 1] indicating the maximum FDR an event can have in order to be considered an outlier.

type: number
default: 0.05

A non-negative number. Z scores (in absolute value) greater than this cutoff are considered as outliers.

type: number

An integer that controls the maximum value that the encoding dimension can take.

type: integer
default: 3

An integer that sets the batch size for counting reads within a bam file. If memory issues persist lower the yieldSize.

type: integer
default: 2000000

Full path to a yaml file specifying lists of candidate genes per sample to test during FDR correction. See the documentation for details on the structure of this file.

type: string
pattern: ^\S+\.(yaml|yml)$

Aberrant splicing options.

Run aberrant splicing analysis.

type: boolean

A comma-separated list of aberrant splicing groups to run. If empty, all groups are run.

type: string

A positive number indicating the minimum number of samples that a group needs in order to be analyzed. We recommend at least 50.

type: integer
default: 1

If true, it forces samples to be recounted.

type: boolean

Set to true only if counting Nanopore or PacBio long reads.

type: boolean

Set to true if non standard chromosomes are to be kept for further analysis.

type: boolean

If true, no filter is applied. We recommend filtering.

type: boolean

The minimal read count in at least one sample required for an intron to pass the filter.

type: integer
default: 20

The minimum total read count (N) an intron needs to have at the specified quantile across samples to pass the filter. See --as_quantile_for_filtering.

type: integer
default: 10

Defines at which percentile the as_quantile_min_expression filter is applied.

type: number
default: 0.05

A value of 0.95 means that at least 5% of the samples need to have a total read count N >= as_quantile_min_expression to pass the filter.

The minimal variation (in delta psi) required for an intron to pass the filter.

type: number
default: 0.05

Methods to remove sample covariation in FRASER.

type: string

Version of FRASER to use.

type: string
default: FRASER

A non-negative number. Delta psi values greater than this cutoff are considered as outliers. Set to 0.1 when using FRASER2.

type: number
default: 0.3

Same as in aberrant expression.

type: number
default: 0.1

An integer that controls the maximum value that the encoding dimension can take.

type: integer
default: 6

Full path to a yaml file specifying lists of candidate genes per sample to test during FDR correction. See the documentation for details on the structure of this file.

type: string
pattern: ^\S+\.(yaml|yml)$

Mono-allelic expression options.

Run mono-allelic expression analysis.

type: boolean

A comma-separated list of mono-allelic expression groups to run. If empty, all groups are run.

type: string

If true (recommended), it emits the header warnings of a VCF file when performing the allelic counts

type: boolean

Same as in aberrant expression.

type: number
default: 0.05

A number between [0.5, 1) indicating the maximum allelic ratio allele1/(allele1+allele2) for the test to be significant.

type: number
default: 0.8

Skip the addition of the allele frequencies from gnomAD

type: boolean

Maximum allele frequency (of the minor allele) cut-off. Variants with AF equal or below this number are considered rare.

type: number
default: 0.001

Maximum variant frequency among the cohort.

type: number
default: 0.05

Full path to the vcf file used for VCF-BAM matching.

type: string
pattern: ^\S+\.vcf\.gz$

Full path to the index of the vcf file used for VCF-BAM matching.

type: string
pattern: ^\S+\.tbi$

Same as 'groups', but for the VCF-BAM matching

type: string

fraction (0-1) used to seperate 'matching' samples and 'non-matching' samples comparing the DNA and RNA data during QC.

type: number
default: 0.05

Reference genome related files and options required for the workflow.

Do not load the iGenomes reference config.

hidden
type: boolean

Do not load igenomes.config when running the pipeline. You may choose this option if you observe clashes between custom parameters and those supplied in igenomes.config.

The base path to the igenomes reference files

hidden
type: string
default: s3://ngi-igenomes/igenomes/

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden
type: string
default: master

Base directory for Institutional configs.

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/configs/master

If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.

Institutional config name.

hidden
type: string

Institutional config description.

hidden
type: string

Institutional config contact information.

hidden
type: string

Institutional config URL link.

hidden
type: string

Less common options for the pipeline, typically set in a config file.

Email address for completion summary.

type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.

Display version and exit.

hidden
type: boolean

Method used to save pipeline results to output directory.

hidden
type: string

The Nextflow publishDir option specifies which intermediate files should be saved to the output directory. This option tells the pipeline what method should be used to move these files. See Nextflow docs for details.

Email address for completion summary, only when pipeline fails.

hidden
type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.

Send plain-text email instead of HTML.

hidden
type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden
type: string
default: 25.MB
pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Do not use coloured log outputs.

hidden
type: boolean

Incoming hook URL for messaging service

hidden
type: string

Incoming hook URL for messaging service. Currently, MS Teams and Slack are supported.

Custom config file to supply to MultiQC.

hidden
type: string

Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file

hidden
type: string

Custom MultiQC yaml file containing HTML including a methods description.

type: string

Boolean whether to validate parameters against the schema at runtime

hidden
type: boolean
default: true

Base URL or local path to location of pipeline test dataset files

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/test-datasets/

Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.

hidden
type: string