Cluster protein sequences using sequence similarity
meta
:map
Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]
sequences
:file
fasta file of sequences to be clustered
*.{fa,fasta}
fasta
*.fasta
fasta file of the representative sequences for each cluster
*.{fasta}
clusters
*.clstr
List of clusters
*.{clstr}
versions_cdhit
${task.process}
:string
The name of the process
cdhit
The name of the tool
cd-hit -h | sed -n '1s/.*version \([0-9.]*\).*/\1/p'
:eval
The expression to obtain the version of the tool
versions
Clusters and compares protein or nucleotide sequences