Description

Subworkflow that generates a clustering TSV file from a set of amino acid sequence within a FASTA file.

Input

name
description
pattern

sequences

Fasta file containing sequences for clustering.

clustering_tool

Clustering algorithm of MMSeqs to use for cluster generation. Options are ‘linclust’ or ‘cluster’.

Output

name
description
pattern

versions

Versions file containing the software versions used in the workflow.

seqs

The input FASTA file mapped to its corresponding clustering tsv, per input entry.

clusters

Clustering file mapped to its corresponding input FASTA file.