Description

A submodule that clusters the merged AMP hits generated from ampcombi2/parsetables and ampcombi2/complete using MMseqs2 cluster.

Input

Name (Type)
Description
Pattern

summary_file (file)

A file corresponding to the Ampcombi_summary.tsv that is generated by running ‘ampcombi complete’. It is a file containing all the merged AMP results from all samples and all tools.

*.tsv

Output

Name (Type)
Description
Pattern

cluster_tsv (file)

A file containing all the results from the merged input table ‘Ampcombi_summary.tsv’, but also including the cluster id number. The clustering is done using MMseqs2 cluster.

*.tsv

rep_cluster_tsv (file)

A file containing the representative sequences of the clusters estimated by the tool. The clustering is done using MMseqs2 cluster.

*.tsv

log (file)

A log file that captures the standard output for the entire process in a log file. Can be activated by --log.

*.log

versions (file)

File containing software versions

versions.yml

Tools

ampcombi2/cluster
MIT

A tool for clustering all AMP hits found across many samples and supporting many AMP prediction tools.