nf-core/nanoseq
Nanopore demultiplexing, QC and alignment pipeline
2.0.0
). The latest
stable release is
3.1.0
.
Define where the pipeline should find input data and save output data.
Path to comma-separated file containing information about the samples in the experiment.
string
./samplesheet.csv
^\S+\.csv$
You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row. See usage docs.
Path to comma-separated file containing information about the samples in the experiment.
string
You will need to create a design file with information about the samples in your experiment before running the pipeline. Use this parameter to specify its location. It has to be a comma-separated file with 3 columns, and a header row. See usage docs.
Path to the output directory where the results will be saved.
string
./results
Email address for completion summary.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
Set this parameter to your e-mail address to get a summary e-mail with details of the run sent to you when the workflow exits. If set in your user config file (~/.nextflow/config
) then you don't need to specify this on the command line for every run.
MultiQC report title. Printed as page header, used for filename if not otherwise specified.
string
Options required to basecall and demultiplex samples.
Path to Nanopore run directory (e.g. 'fastq_pass/') or a basecalled fastq file that requires demultiplexing. The latter can only be provided in conjunction with the '--skip_basecalling' parameter.
string
Flowcell used to perform the sequencing e.g. 'FLO-MIN106'. Not required if '--guppy_config' is specified.
string
Kit used to perform the sequencing e.g. 'SQK-LSK109'. Not required if '--guppy_config' is specified.
string
Barcode kit used to perform the sequencing e.g. 'SQK-PBK004'.
string
If you would like to skip the basecalling (--skip_basecalling
) but still perform the demultiplexing please specify a barcode kit that can be recognised by qcat:
| qcat
barcode kit specifications | description |
|-----------------------------------|-------------------------------------------------------------------------------|
| Auto
| Auto detect barcoding kit |
| RBK001
| Rapid barcoding kit |
| RBK004
| Rapid barcoding kit v4 |
| NBD103/NBD104
| Native barcoding kit with barcodes 1-12 |
| NBD114
| Native barcoding kit with barcodes 13-24 |
| NBD104/NBD114
| Native barcoding kit with barcodes 1-24 |
| PBC001
| PCR barcoding kits with 12 barcodes |
| PBC096
| PCR barcoding kits with 96 barcodes |
| RPB004/RLB001
| Rapid PCR Barcoding Kit (SQK-RPB004) and Rapid Low Input by PCR Barcoding Kit |
| RPB004/LWB001
| Low Input by PCR Barcoding Kit |
| RAB204
| 16S Rapid Amplicon Barcoding Kit with 12 Barcodes |
| VMK001
| Voltrax Barcoding Kit with 4 barcodes |
Require barcode on both ends for Guppy basecaller.
boolean
Config file used for basecalling that will be passed to Guppy via the '--config' parameter.
string
Cannot be used in conjunction with --flowcell
and --kit
. This can be a local file (e.g. /your/dir/guppy_conf.cfg
) or a string specifying a configuration stored in the /opt/ont/guppy/data/
directory of Guppy.
Custom basecalling model file in json
format that will be passed to Guppy via the '--model' parameter.
string
Custom basecalling models can be trained with software such as Taiyaki. This can also be a string specifying a model stored in the /opt/ont/guppy/data
directory of Guppy.
Whether to demultiplex with Guppy in GPU mode.
boolean
Number of '--gpu_runners_per_device' used for Guppy when using '--guppy_gpu'.
integer
6
Number of '--cpu_threads_per_caller' used for Guppy when using '--guppy_gpu'.
integer
1
Basecalling device specified to Guppy in GPU mode using '--device'.
string
auto
Cluster options required to use GPU resources (e.g. '--part=gpu --gres=gpu:1').
string
Specify the minimum quality score for qcat in the range 0-100.
integer
60
Search for adapters in the whole read by applying the '--detect-middle' parameter in qcat.
boolean
Skip basecalling with Guppy.
boolean
Skip demultiplexing with Guppy/qcat.
boolean
Filter reads from FastQ files using NanoLyse
boolean
Fasta file to be filtered against using NanoLyse
undefined
Options to adjust parameters and filtering criteria for read alignments.
Specifies the aligner to use i.e. 'minimap2' or 'graphmap2'.
undefined
minimap2
Specifies if the data is strand-specific. Automatically activated when using '--protocol directRNA'.
boolean
When using --protocol
/--stranded
the following command-line arguments will be set for minimap2
and graphmap2
:
| nanoseq
input | minimap2
presets | graphmap2
presets |
|------------------------------|---------------------|---------------------|
| --protocol DNA
| -ax map-ont | no presets |
| --protocol cDNA
| -ax splice | -x rnaseq |
| --protocol directRNA
| -ax splice -uf -k14 | -x rnaseq |
| --protocol cDNA --stranded
| -ax splice -uf | -x rnaseq |
Save the '.sam' files from the alignment step - not done by default.
boolean
Skip alignment and downstream processes.
undefined
Options to adjust quantification and differential analysis
Specifies the transcript quantification method to use (available are: bambu or stringtie2). Only available when protocol is cDNA or directRNA.
undefined
bambu
Skip transcript quantification and differential analysis.
boolean
Skip differential analysis with DESeq2 and DEXSeq.
boolean
Options to skip various steps within the workflow.
Skip BigBed file generation.
boolean
Skip BigWig file generation.
boolean
Skip pycoQC.
boolean
Skip NanoPlot.
boolean
Skip FastQC.
boolean
Skip MultiQC.
boolean
Skip all QC steps apart from MultiQC.
boolean
Reference genome related files and options required for the workflow.
Directory / URL base for iGenomes references.
string
s3://ngi-igenomes/igenomes
Do not load the iGenomes reference config.
boolean
Do not load igenomes.config
when running the pipeline. You may choose this option if you observe clashes between custom parameters and those supplied in igenomes.config
.
Parameters used to describe centralised config profiles. These should not be edited.
Git commit id for Institutional configs.
string
master
Base directory for Institutional configs.
string
https://raw.githubusercontent.com/nf-core/configs/master
If you're running offline, Nextflow will not be able to fetch the institutional config files from the internet. If you don't need them, then this is not a problem. If you do need them, you should download the files from the repo and tell Nextflow where to find them with this parameter.
Institutional configs hostname.
string
Institutional config name.
string
Institutional config description.
string
Institutional config contact information.
string
Institutional config URL link.
string
Set the top limit for requested resources for any single job.
Maximum number of CPUs that can be requested for any single job.
integer
16
Use to set an upper-limit for the CPU requirement for each process. Should be an integer e.g. --max_cpus 1
Maximum amount of memory that can be requested for any single job.
string
128.GB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Use to set an upper-limit for the memory requirement for each process. Should be a string in the format integer-unit e.g. --max_memory '8.GB'
Maximum amount of time that can be requested for any single job.
string
240.h
^(\d+\.?\s*(s|m|h|day)\s*)+$
Use to set an upper-limit for the time requirement for each process. Should be a string in the format integer-unit e.g. --max_time '2.h'
Less common options for the pipeline, typically set in a config file.
Display help text.
boolean
Method used to save pipeline results to output directory.
string
The Nextflow publishDir
option specifies which intermediate files should be saved to the output directory. This option tells the pipeline what method should be used to move these files. See Nextflow docs for details.
Email address for completion summary, only when pipeline fails.
string
^([a-zA-Z0-9_\-\.]+)@([a-zA-Z0-9_\-\.]+)\.([a-zA-Z]{2,5})$
An email address to send a summary email to when the pipeline is completed - ONLY sent if the pipeline does not exit successfully.
Send plain-text email instead of HTML.
boolean
File size limit when attaching MultiQC reports to summary emails.
string
25.MB
^\d+(\.\d+)?\.?\s*(K|M|G|T)?B$
Do not use coloured log outputs.
boolean
Custom config file to supply to MultiQC.
string
Directory to keep pipeline Nextflow logs and reports.
string
${params.outdir}/pipeline_info
Boolean whether to validate parameters against the schema at runtime
boolean
true
Show all params when using --help
boolean
Run this workflow with Conda. You can also use '-profile conda' instead of providing this parameter.
boolean
Instead of directly downloading Singularity images for use with Singularity, force the workflow to pull and convert Docker containers instead.
boolean
This may be useful for example if you are unable to directly pull Singularity containers to run the pipeline due to http/https proxy issues.