Description

The goal of the Shasta long read assembler is to rapidly produce accurate assembled sequence using DNA reads generated by Oxford Nanopore flow cells as input. Please note Assembler is design to focus on speed, so assembly may be considered somewhat non-deterministic as final assembly may vary across executions. See https://github.com/chanzuckerberg/shasta/issues/296.

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

reads:file

Input file in FASTQ format.

*.{fastq,fastq.gz,fq,fq.gz}

Output

name:type
description
pattern

assembly

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*_Assembly.fasta.gz:file

Assembled FASTA file

${prefix}_Assembly.fasta.gz

gfa

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*_Assembly.gfa.gz:file

Repeat graph

${prefix}_Assembly.gfa.gz

results

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

ShastaRun/:directory

Resulting assembly directory

ShastaRun

versions

versions.yml:file

File containing software versions

versions.yml

Tools

shasta
MIT

Rapidly produce accurate assembled sequence using as input DNA reads generated by Oxford Nanopore flow cells.