methylseq: Parameters

Define where the pipeline should find input data and save output data.

Path to comma-separated file containing information about the samples in the experiment.

required

type: string

pattern: ^\S+\.csv$

The output directory where the results will be saved. You have to use absolute paths to storage on Cloud infrastructure.

required

type: string

Email address for completion summary.

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

MultiQC report title. Printed as page header, used for filename if not otherwise specified.

type: string

Options for saving a variety of intermediate files

Save reference(s) to results directory

type: boolean

Save aligned intermediates to results directory

type: boolean

Bismark only - Save unmapped reads to FastQ files

type: boolean

Save trimmed reads to results directory.

type: boolean

Options for the reference genome indices used to align reads.

Name of iGenomes reference.

type: string

Path to FASTA genome file

type: string

pattern: ^\S+\.fn?a(sta)?(\.gz)?$

Path to Fasta index file.

type: string

pattern: ^\S+\.fn?a(sta)?.fai$

Path to a directory containing a Bismark reference index.

type: string

bwameth index filename base

type: string

Path to the BWA-MEM index filename base

type: string

Do not load the iGenomes reference config.

hidden

type: boolean

The base path to the igenomes reference files

hidden

type: string

default: s3://ngi-igenomes/igenomes/

Alignment tool to use.

required

type: string

Use BWA-MEM2 algorithm for BWA-Meth indexing and alignment.

type: boolean

Presets for working with specific bisulfite library preparation methods.

Preset for working with TET-assisted pyridine borane sequencing (TAPS) libraries.

type: boolean

Preset for working with PBAT libraries.

type: boolean

Turn on if dealing with MspI digested material.

type: boolean

Run bismark in SLAM-seq mode.

type: boolean

Preset for EM-seq libraries.

type: boolean

Trimming preset for single-cell bisulfite libraries.

type: boolean

Trimming preset for the Accel kit.

type: boolean

Trimming preset for the Zymo kit.

type: boolean

Bisulfite libraries often require additional base pairs to be removed from the ends of the reads before alignment.

In addition to manually specifying bases to be hard-clipped, the workflow has a number of parameter presets:

Parameter	5’ R1 Trim	5’ R2 Trim	3’ R1 Trim	3’ R2 Trim
`--pbat`	8	8	8	8
`--single_cell`	6	6	6	6
`--accel`	10	15	10	10
`--zymo`	10	15	10	10

Note that you can use the --skip_trimming parameter to skip trimming completely. Further, --skip_trimming_presets disables setting any presets for hard-clipping, thereby uncoupling trimming from specific alignment modes entirely

Trim bases from the 5’ end of read 1 (or single-end reads).

type: integer

Trim bases from the 5’ end of read 2 (paired-end only).

type: integer

Trim bases from the 3’ end of read 1 AFTER adapter/quality trimming.

type: integer

Trim bases from the 3’ end of read 2 AFTER adapter/quality trimming

type: integer

Trim bases below this quality value from the 3’ end of the read, ignoring high-quality G bases

type: integer

Discard reads that become shorter than INT because of either quality or adapter trimming.

type: integer

Skip presetting trimming parameters entirely

type: boolean

Parameters specific to the Bismark workflow

Run alignment against all four possible strands.

type: boolean

Output stranded cytosine report, following Bismark’s bismark_methylation_extractor step.

type: boolean

Turn on to relax stringency for alignment (set allowed penalty with –num_mismatches).

type: boolean

0.6 will allow a penalty of bp * -0.6 - for 100bp reads (bismark default is 0.2)

type: number

default: 0.6

Specify a minimum read coverage to report a methylation call

type: integer

Ignore read 2 methylation when it overlaps read 1

type: boolean

default: true

Ignore methylation in first n bases of 5’ end of R1

type: integer

Ignore methylation in first n bases of 5’ end of R2

type: integer

default: 2

Ignore methylation in last n bases of 3’ end of R1

type: integer

Ignore methylation in last n bases of 3’ end of R2

type: integer

default: 2

Supply a .gtf file containing known splice sites (bismark_hisat only).

type: string

pattern: ^\S+\.gtf(\.gz)?$

Allow soft-clipping of reads (potentially useful for single-cell experiments).

type: boolean

Align against a single combined (CT+GA) Bismark index instead of the classic per-strand model.

type: boolean

The minimum insert size for valid paired-end alignments.

type: integer

The maximum insert size for valid paired-end alignments.

type: integer

Sample is NOMe-seq or NMT-seq. Runs coverage2cytosine.

type: boolean

Merges methylation calls for every strand into a single, context dependent file.

type: boolean

Parameters specific to MethylDackel methylation calling step

Call methylation in all three CpG, CHG and CHH contexts.

type: boolean

Merges methylation metrics of the Cytosines in a given context.

type: boolean

Specify a minimum read coverage for MethylDackel to report a methylation call.

type: integer

MethylDackel - ignore SAM flags

type: boolean

Save files for use with methylKit

type: boolean

Parameters specific to the RASTAIR workflow

Nucleotides to exclude from methylation calling on Original Top (strand) reads, following this pattern: R1 start, R1 end, R2 start, R2 end

type: string

default: 0,0,10,0

Nucleotides to exclude from methylation calling on Original Bottom (strand) reads, following this pattern: R1 start, R1 end, R2 start, R2 end

type: string

default: 0,0,10,0

Options for methurator tool

Minimum CpGs coverage to consider for the saturation analysis. Can be a single integer or a list (e.g. 1,3,5).

type: string

default: 1,10,15

Compute confidence intervals using bootstrap replicates.

type: boolean

Maximum extrapolation factor.

type: integer

default: 10

Qualimap configurations

A GFF or BED file containing the target regions which will be passed to Qualimap/Bamqc.

type: string

pattern: ^\S+\.gff|\.bed(\.gz)?$

Targeted sequencing analysis configurations. They need --run_targeted_sequencing to have an effect

A BED file containing the target regions

type: string

pattern: ^\S+|\.bed(\.gz)?$

Run Picard CollectHsMetrics in the targeted analysis

type: boolean

Skip read trimming.

type: boolean

Skip deduplication step after alignment.

type: boolean

Skip FastQC

type: boolean

Skip MultiQC

type: boolean

Run preseq/lcextrap tool

type: boolean

Run methurator/gtestimator tool

type: boolean

Run qualimap/bamqc tool

type: boolean

Run advanced analysis for targeted methylation kits with enrichment of specific regions

type: boolean

Parameters used to describe centralised config profiles. These should not be edited.

Git commit id for Institutional configs.

hidden

type: string

default: master

Base directory for Institutional configs.

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/configs/master

Institutional config name.

hidden

type: string

Institutional config description.

hidden

type: string

Institutional config contact information.

hidden

type: string

Institutional config URL link.

hidden

type: string

Less common options for the pipeline, typically set in a config file.

Display version and exit.

hidden

type: boolean

Method used to save pipeline results to output directory.

hidden

type: string

Email address for completion summary, only when pipeline fails.

hidden

type: string

pattern: ^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$

Send plain-text email instead of HTML.

hidden

type: boolean

File size limit when attaching MultiQC reports to summary emails.

hidden

type: string

default: 25.MB

pattern: ^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$

Do not use coloured log outputs.

hidden

type: boolean

Custom config file to supply to MultiQC.

hidden

type: string

Custom logo file to supply to MultiQC. File name must also be set in the MultiQC config file

hidden

type: string

Custom MultiQC yaml file containing HTML including a methods description.

type: string

Boolean whether to validate parameters against the schema at runtime

hidden

type: boolean

default: true

Base URL or local path to location of pipeline test dataset files

hidden

type: string

default: https://raw.githubusercontent.com/nf-core/test-datasets/methylseq/

Suffix to add to the trace report filename. Default is the date and time in the format yyyy-MM-dd_HH-mm-ss.

hidden

type: string

Display the help message.

type: boolean,string

Display the full detailed help message.

type: boolean

Display hidden parameters in the help message (only works when –help or –help_full are provided).

type: boolean

nf-core/methylseq