Cluster protein sequences using sequence similarity
meta{:bash}
:map
Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]
sequences{:bash}
:file
fasta file of sequences to be clustered
*.{fa,fasta}
fasta{:bash}
*.fasta{:bash}
fasta file of the representative sequences for each cluster
*.{fasta}
clusters{:bash}
*.clstr{:bash}
List of clusters
*.{clstr}
versions_cdhit{:bash}
${task.process}{:bash}
:string
The name of the process
cdhit{:bash}
The name of the tool
cd-hit -h | sed -n '1s/.*version \([0-9.]*\).*/\1/p'{:bash}
:eval
The expression to obtain the version of the tool
versions{:bash}
Clusters and compares protein or nucleotide sequences