dualrnaseq: Parameters

Define where the pipeline should find input data and save output data.

Path to comma-separated file containing information about the samples in the experiment.

required

type: string

pattern: ^\S+\.csv$

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

required

type: string

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

Option to generate mapping statistics, creating plots and summaries

type: boolean

To ignore igenomes reference config

type: string

default: genomes,igenomes_base,igenomes_ignore

Reference genome related files and options required for the workflow.

Name of iGenomes reference.

type: string

Path to FASTA genome file.

type: string

pattern: ^\S+\.fn?a(sta)?(\.gz)?$

Do not load the iGenomes reference config.

hidden

type: boolean

default: true

The base path to the igenomes reference files

hidden

type: string

default: s3://ngi-igenomes/igenomes/

The path to the files should be enclosed by quotes ”../..”

default param???

type: string

Change to custom name if desired, ie Human_hela_cells

type: string

default: host

Change to custom name if desired, ie Salmonella_SL1344

type: string

default: pathogen

Host genome fasta file

type: string

Pathogen genome fasta file

type: string

Host GFF file

type: string

Pathogen GFF

type: string

Host transcriptome file

type: string

Pathogen transcriptome file

type: string

By default, the pipeline utilizes FastQC tool for quality control of sequencing reads, run before and after trimming

Define a set of additional fastqc parameters you wish to use, except —quiet —threads —noextract flags which are already specified in the dualrnaseq pipeline

type: string

Adapter and read trimming is performed by either Cutadapt or BBDuk.

Additional parameters if needed

type: string

These parameters are available for Salmon in both Selective Alignment and Alignment-based mode

Options for setting the library type. A = automatic detection

type: string

The pipeline uses gene features from the 3rd column of the host annotative file (gff3) to extract the coordinates of transcripts to be quantified. By default, the pipeline useanscriptome_hosts exon from the —gff_host

type: string

default: exon

The pipeline uses gene features from the 3rd column of the pathogen annotative fikle (gff3) to extract the coordinates of transcripts to be quantified. By default, the pipeline uses features as gene, sRNA, tRNA and rRNA from the —gff_pathogen file.

type: string

default: gene,sRNA,tRNA,rRNA

This flag defines the gene attribute from the 9th column of the host annotative (gff3) file, where the transcript names are extracted. By default, the pipeline extracts transcript_id from the —gff_host file

type: string

default: transcript_id

This flag defines the gene attribute from the 9th column of the pathogen annotative (gff3) file, where transcript, genes or CDS regions are extracted. By default, the pipeline extracts locus_tag from the —gff_pathogen file

type: string

default: locus_tag

Still to be described

type: string

default: exon

Still to be described - requires capital P though

type: string

default: Parent

Parameters listed below are available only for Salmon with Selective Alignment.

Run Salmon selective alignment. Does not need a value, just run —salmon_sa

type: boolean

default: true

Set of additional parameters for creating an index with Salmon Selective Alignment. By default, the kmer size is set at 21. Multiple parameters can be passed - for example: —salmon_sa_index_args=“—keepDuplicates -k 21”.

type: string

default: -k 21

Set of additional parameters for mapping with Salmon Selective Alignment. By default, the pipeline allows soft-clipping of overhanging reads. Multiple parameters can be passed - for example: —salmon_sa_args=“—softclipOverhangs —allowDovetail”

type: string

default: --softclipOverhangs

Options for Alignment-based mode

To run Salmon alignment-based mode

type: boolean

STAR parameter - To create a transcriptome Bam (to use with Salmon)

type: string

default: TranscriptomeSAM

The nf-core/dualrnaseq pipeline runs STAR to generate transcriptomic alignments. By default, it allows for insertions, deletions and soft-clips (Singleend option). To prohibit this behaviour, please specify IndelSoftclipSingleend

type: string

default: Singleend

Define a set of additional salmon quant parameters you wish to use in salmon alignment-based mode.

type: string

Options for STAR genome alignment

To run STAR genome alignment

type: boolean

Quant value in GFF 3rd column

type: string

default: quant

parent attribule in GFF - last column

type: string

default: parent

By default, the pipeline saves unmapped reads within the main BAM file. If you want to switch off this option, set the —outSAMunmapped flag to None

type: string

default: Within

Option to limit RAM when sorting BAM file. If 0, will be set to the genome index size, which can be quite large when running on a desktop or laptop

type: integer

Additional args to pass to STAR

type: string

Options for HTSeq count

To run HTSeq count on a aligned genome file

type: boolean

3rd value of GFF to quantify

type: string

default: quant

Gene attributes from the 9th column in GFF to use

type: string

default: locus_tag

Counting mode: Options of ‘union’, ‘intersection-strict’, ‘intersection-nonempty’

type: string

default: union

For counting non-unique reads. Options of ‘none’, ‘all’, ‘fraction’, ‘random’

type: string

default: none

Read strand orientation. Options of ‘yes’, ‘no’, ‘reverse’

type: string

default: yes

by default the bam file is sorted by position - Important for PE reads

type: string

default: pos

Features from 3rd column in GFF to use

type: string

default: gene

Gene attributes from the 9th column in GFF to use

type: string

default: gene_id

Features from 3rd column in GFF to use

type: string

default: gene,sRNA,tRNA,rRNA

Gene attributes from the 9th column in GFF to use

type: string

default: locus_tag

Additional args to pass to HTSeq-count

type: string

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional config name.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Institutional config URL link.

hidden

type: string

Less common options for the pipeline, typically set in a config file.

Display help text.

hidden

type: boolean

Display version and exit.

hidden

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Do not use coloured log outputs.

hidden

type: boolean

Incoming hook URL for messaging service

hidden

type: string

Custom config file to supply to MultiQC.

hidden

type: string

Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file

hidden

type: string

Custom MultiQC yaml file containing HTML including a methods description.

type: string

Boolean whether to validate parameters against the schema at runtime

hidden

type: boolean

default: true

Base URL or local path to location of pipeline test dataset files

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/test-datasets/

Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.

hidden

type: string

Show all params when using --help

hidden

type: boolean