Description

Cluster protein sequences using sequence similarity

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

sequences

:file

fasta file of sequences to be clustered

*.{fa,fasta}

Output

name:type
description
pattern

fasta

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.fasta

:file

fasta file of the representative sequences for each cluster

*.{fasta}

clusters

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.clstr

:file

List of clusters

*.{clstr}

versions_cdhit

${task.process}

:string

The name of the process

cdhit

:string

The name of the tool

cd-hit -h | sed -n '1s/.*version \([0-9.]*\).*/\1/p'

:eval

The expression to obtain the version of the tool

Topics

name:type
description
pattern

versions

${task.process}

:string

The name of the process

cdhit

:string

The name of the tool

cd-hit -h | sed -n '1s/.*version \([0-9.]*\).*/\1/p'

:eval

The expression to obtain the version of the tool