airrflow: Parameters

Define where the pipeline should find input data and save output data.

Specify the subworkflow to be executed.

hidden

type: string

Path to a tsv file providing paths to the fastq files for each sample and the necessary metadata for the analysis.

required

type: string

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

required

type: string

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Experimental protocol used to generate the data

Protocol used for the V(D)J amplicon sequencing library generation.

type: string

Path to fasta file containing the linker sequence, if no V-region primers were used but a linker sequence is present (e.g. 5’ RACE SMARTer TAKARA protocol).

type: string

Define the paths to the igblast and IMGT databases if you have them cached.

Path to the cached igblast database.

type: string

Path to the cached igblast database.

type: string

Save databases so you can use the cache in future runs.

type: boolean

Define the primer region start and how to deal with the primer alignment.

Path to a fasta file containinc the V-region primer sequences.

type: string

Path to a fasta file containing the C-region primer sequences.

type: string

Start position of V region primers (without counting the UMI barcode).

type: integer

Start position of C region primers (without counting the UMI barcode).

type: integer

Indicate if C region primers are in the R1 or R2 reads.

type: string

Specify to match the tail-end of the sequence against the reverse complement of the primers. This also reverses the behavior of the –start argument, such that start position is relative to the tail-end of the sequence. (default: False)Maximum scoring error for the Presto MaxPrimer process for the C and/or V region primers identification.

type: boolean

Define how UMI barcodes should be treated.

Indicate if UMI indices are recorded in a separate index file.

type: boolean

Indicate if UMI indices are recorded in the R1 (default) or R1 fastq file.

type: string

UMI barcode length in nucleotides. Set to 0 if no UMIs present.

type: integer

default: -1

UMI barcode start position in the index read.

type: integer

Options for adapter trimming and read clipping

Whether to trim adapters in fastq reads with fastp.

type: boolean

default: true

type: string

Number of bases to clip 5’ in R1 reads.

type: integer

Number of bases to clip 5’ in R2 reads.

type: integer

Number of bases to clip 3’ in R1 reads.

type: integer

Number of bases to clip 3’ in R2 reads.

type: integer

Trim adapters specific for Nextseq sequencing

type: boolean

Option to save trimmed reads.

type: boolean

Options for the presto tools

Quality threshold for Presto FilterSeq sequence filtering.

type: integer

default: 20

Maximum primer scoring error in the Presto MaskPrimer step for the C and/or V region primers identification.

type: number

default: 0.2

Maximum error for building the primer consensus in the Presto Buildconsensus step.

type: number

default: 0.6

Masking mode for the Presto MaskPrimer step. Available: cut, mask, trim, tag.

type: string

Maximum error for building the sequence consensus in the Presto BuildConsensus step.

type: number

default: 0.1

Maximum gap for building the sequence consensus in the Presto BuildConsensus step.

type: number

default: 0.5

Cluster sequences by similarity regardless of any annotation with Presto ClusterSets and annotate the cluster ID additionally to the UMI barcode.

type: boolean

default: true

Define how the B-cell clonal trees should be calculated.

Set to true if to manually adjust the clustering threshold for cell clones.

type: boolean

Set the clustering threshold Hamming distance value.

type: number

default: 0.14

Set the method for finding the clustering threshold.

type: string

default: density

Define downstream analysis options.

Skip repertoire analysis and report generation

type: boolean

Skip clonal lineage analysis and lineage tree plotting.

type: boolean

Skip multiqc report

type: boolean

Custom report Rmarkdown file.

type: string

default: ${projectDir}/assets/repertoire_comparison.Rmd

Custom report style file in css format.

type: string

default: ${projectDir}/assets/nf-core_style.css

Custom logo for the report.

type: string

default: ${projectDir}/assets/nf-core-airrflow_logo_light.png

Options for software packaging

Enable conda to run pipeline with conda environment.

type: boolean

Options for the reference genome indices used to align reads.

Directory / URL base for iGenomes references.

hidden

type: string

default: s3://ngi-igenomes/igenomes

Do not load the iGenomes reference config.

hidden

type: boolean

default: true

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional config name.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Directory to keep pipeline Nextflow logs and reports.

hidden

type: string

default: ${params.outdir}/pipeline_info

Set the top limit for requested resources for any single job.

Maximum number of CPUs that can be requested for any single job.

hidden

type: integer

default: 16

Maximum amount of memory that can be requested for any single job.

hidden

type: string

default: 128.GB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Maximum amount of time that can be requested for any single job.

hidden

type: string

default: 240.h

pattern: ^(\d+\.?\s*(s|m|h|day)\s*)+$

Less common options for the pipeline, typically set in a config file.

Display help text.

hidden

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Do not use coloured log outputs.

hidden

type: boolean

Incoming hook URL for messaging service

hidden

type: string

Custom config file to supply to MultiQC.

hidden

type: string

Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file

hidden

type: string

Custom MultiQC yaml file containing HTML including a methods description.

type: string

Directory to keep pipeline Nextflow logs and reports.

hidden

type: string

default: ${params.outdir}/pipeline_info

Boolean whether to validate parameters against the schema at runtime

hidden

type: boolean

default: true

Show all params when using --help

hidden

type: boolean

Arguments for this subworkflow

Name of the field used to collapse duplicated sequences

type: string

default: filename,cell_id

Name of the field used to group data files to identify clones

type: string

default: subject_id

Whether to reassign genes if the input file is an AIRR formatted tabulated file

type: boolean

default: true

Subset to productive sequences

type: boolean

default: true

Whether to apply the chimera removal filter

type: boolean

default: true

Use auto to automatically set a threshold to identify clonally related sequences. Set

type: string,number

default: auto

Path to MiAIRR-BioSample mapping

type: string

default: bcellmagic/assets/reveal/mapping_MiAIRR_BioSample_v1.3.1.tsv

Whether input samples include single cell sequencing samples

type: string

default: single_cell

nf-core/airrflow