Description

A tool to raise the quality of viral genomes assembled from short-read metagenomes via resolving and joining of contigs fragmented during de novo assembly.

Input

Name (Type)
Description
Pattern

meta (map)

Groovy Map containing sample information
e.g. [ id:'sample1' ]

fasta (file)

Assembly file (contigs/scaffolds) in FASTA format

*.{fa,fasta,fna,fa.gz,fasta.gz,fna.gz}

coverage (file)

TSV file with 2 columns containing 1) the contig/scaffold id and
2) the coverage depth of the sequence specified in column 1

*.{tsv,txt}

query (file)

File containing the query contigs that the user wants COBRA to extend. This
can be provided in one-column TXT or FASTA format. (The IDs must match the IDs
in the --fasta file exactly)

*.{txt,fasta}

bam (file)

Sorted BAM/CRAM/SAM file resulting from mapping reads used in assembly
to the resulting assembly FASTA

*.{bam,cram,sam}

assembler (string)

The name of the tool used to assemble contigs

{metaspades,megahit,idba}

mink (integer)

The minimum kmer size used to assemble contigs

[0-9]+

maxk (integer)

The maximum kmer size used to assemble contigs

[0-9]+

Output

Name (Type)
Description
Pattern

meta (map)

Groovy Map containing sample information
e.g. [ id:'sample1', single_end:false ]

extended_assemblies (file)

Gzipped FASTA file containing query contigs that are already circular

${prefix}/COBRA_category_i_self_circular.fasta.gz

extended_circular (file)

Gzipped FASTA file containing query contigs that were joined and extended
into a circular genome

${prefix}/COBRA_category_ii-a_extended_circular_unique.fasta.gz

extended_partial (file)

Gzipped FASTA file containing query contigs were joined and extended, but
not into circular sequences

${prefix}/COBRA_category_ii-b_extended_partial_unique.fasta.gz

extended_failed (file)

Gzipped FASTA file containing query contigs that could not be extended due
to COBRA rules

${prefix}/COBRA_category_ii-c_extended_failed.fasta.gz

orphan_end (file)

Gzipped FASTA file containing query contigs that do not shared the
”expected overlap length” with other contigs

${prefix}/COBRA_category_iii_orphan_end.fasta.gz

all_assemblies (file)

Gzipped FASTA file containing all FASTA outputs combined into one file

${prefix}/COBRA_all_assemblies.fasta.gz

joining_summary (file)

TSV file containing information regarding COBRA’s extension results

${prefix}/COBRA_joining_summary.txt

log (file)

Log file containing the contents of each processing

${prefix}/log

versions (file)

File containing software versions

versions.yml

Tools

cobra-meta
MIT

COBRA is a tool to get higher quality viral genomes assembled from metagenomes.