Define where the pipeline should find input data and save output data.

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

type: string

Email address for completion summary.

type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config) then you don't need to specify this on the command line for every run.

Define different pipeline parameters.

Joined called and VQSR applied vcf file from the case cohort

type: string

One column text file containing list of samples, one sample ID per line.

type: string

Input files needed for the pipeline from the control dataset, for now we support 3 gnomAD datasets as control, gnomADv2exome, gnomADv4.1exome, gnomADv4.1genome. You need to download these datasets from our Amazon AWS s3 bucket: s3://cocorv-resource-files/

type: string

Resource folder for the annotation tool Annovar, you can download this folder from our Amazon AWS s3 bucket: s3://cocorv-resource-files/

type: string

Resource folder for the annotation tool VEP, you can download this folder from our Amazon AWS s3 bucket: s3://cocorv-resource-files/

type: string

Reference genome build version, allowed values are "GRCh37", "GRCh38". Default value: "GRCh37""

type: string
default: GRCh37

gnomAD version, allowed values are "v2exome", "v4exome", "v4genome" (for GRCh37 data use "v2exome", for GRCh38, use "v4exome" or "v4genome"). Default value: "v2exome""

type: string
default: v2exome

Bed file containing good coverage positions from case vcf files where 90% samples have coverage >= 10.

type: string
default: NA

List of chromosomes you want to analyze, you can test only for chromosome 21 and 22, in that case, it will be "21 22"

type: string
default: 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22

Annotation options for Annovar, default: 'refGene,gnomad211_exome,revel'

type: string
default: refGene,gnomad211_exome,revel

Annotation options for Annovar, default: 'g,f,f'

type: string
default: g,f,f

If you want to add VEP annotation, you need to put "T" for this parameter. Default: "F", meaning do not run VEP annotaion, only run Annovar.

type: string
default: F

VEP annotation parameters

type: string
default: AM,SPLICEAI,CADD,LOFTEE

Annotation options for Annovar

type: string
default: NA

Annotation options for Annovar

type: string
default: NA

File containing estimation of the population/ethnicity of case samples using gnomAD classifier, optional, if not specified nextflow app will estimate the population using gnomAD classifier.

type: string
default: NA

An optional R file with functions or a tab separated two column text file defining the variants of interest

type: string
default: NA

An optional JASON file to provide extra parameters to the custuomized file variantGroupCustom

type: string
default: NA

A one column file without the header listing all required annotations used in variant filtering. AC AN annotations from ACANConfig will be added automatically

type: string
default: NA

Optional options for CoCoRV module

type: string
default: NA

The column header used to pull Gene name from annotation file

type: string
default: Gene.refGene

A one column file specifying the variants to be included, it will also include variants specified in variantExcludeFile

type: string
default: NA

Batch size for CoCoRV module

type: integer
default: 10000

The folder path containing CoCoRV R package

type: string
default: /opt/cocorv/

The maximum of the alternate allele frequency, for gnomADv2exome, we used 0.0001, for gnomADv4.1genome and v4.1exome, we used 0.0005

type: number
default: 0.0001

The maximum missingness allowed for a variant

type: number
default: 0.1

A specified variant group to use for test or a self defined function to define the variants of interest

type: string
default: annovar_pathogenic

The minimum REVEL score for pathogenic missense variants

type: number
default: 0.65

The p-value threshold to detect high LD variants in control

type: number
default: 0.05

Top K genes for generating variant-sample list

type: integer
default: 20

Case or control for which top K genes need to be examined

type: string
default: case

Bed file containing good coverage positions from gnomAD control files where 90% samples have coverage >= 10.

hidden
type: string
default: null/coverage10x.bed.gz

The files containing variant positions needed for gnomAD ancestry prediction classifier

hidden
type: string
default: null/ancestry/hail_positions.chr.pos.tsv

Files needed for gnomAD ancestry prediction classifier

hidden
type: string
default: null/ancestry/gnomad.pca_loadings.ht/

Files needed for gnomAD ancestry prediction classifier

hidden
type: string
default: null/ancestry/gnomad.RF_fit.onnx

The reference genome file

hidden
type: string
default: null/reference.fasta.gz

If you split you joined called VCF file by chromosome, you can supply the VCF file list here, the list needs to be in csv format (comma seperated), the column headers are chr,vcf. Then you need to list the chromosome number and the coresponding vcf file for that chromosome.

type: string
default: NA

Pre-annotated case VCF files, you can use the pre-annotated case VCF files to skip the annotation steps in the pipeline, the list needs to be in csv format (comma seperated), the column headers are chr,vcf. Then you need to list the chromosome number and the coresponding vcf file for that chromosome.

type: string
default: NA

GDS converted genotype case VCF files, you can use it to skip the genotype GDS conversion steps in the pipeline, the list needs to be in csv format (comma seperated), the column headers are chr,gds. Then you need to list the chromosome number and the coresponding gds file for that chromosome.

type: string
default: NA

GDS converted annotated case VCF files, you can use it to skip the annotation GDS conversion steps in the pipeline, the list needs to be in csv format (comma seperated), the column headers are chr,gds. Then you need to list the chromosome number and the coresponding vcf file for that chromosome.

type: string
default: NA

GDS converted genotype control VCF files

hidden
type: string
default: null/controlGenotypeGDS.csv

GDS converted annotated control VCF files

hidden
type: string
default: null/controlAnnotationGDS.csv

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden
type: string
default: master

Base directory for Institutional configs.

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/configs/master

If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.

Institutional config name.

hidden
type: string

Institutional config description.

hidden
type: string

Institutional config contact information.

hidden
type: string

Institutional config URL link.

hidden
type: string

Less common options for the pipeline, typically set in a config file.

Display version and exit.

hidden
type: boolean

Method used to save pipeline results to output directory.

hidden
type: string

The Nextflow publishDir option specifies which intermediate files should be saved to the output directory. This option tells the pipeline what method should be used to move these files. See Nextflow docs for details.

Email address for completion summary, only when pipeline fails.

hidden
type: string
pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.

Send plain-text email instead of HTML.

hidden
type: boolean

Do not use coloured log outputs.

hidden
type: boolean

Incoming hook URL for messaging service

hidden
type: string

Incoming hook URL for messaging service. Currently, MS Teams and Slack are supported.

Boolean whether to validate parameters against the schema at runtime

hidden
type: boolean
default: true

Base URL or local path to location of pipeline test dataset files

hidden
type: string
default: https://raw.githubusercontent.com/nf-core/test-datasets/

Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.

hidden
type: string