Description

A submodule that parses and standardizes the results from various antimicrobial peptide identification tools.

Input

Name (Type)
Description
Pattern

meta (map)

Groovy Map containing sample information
e.g. [ id:'sample1', single_end:false ]

amp_input (list)

The path to the directory containing the results for the AMP tools for each processed sample or a list of files corresponding to each file generated by AMP tools.

[*amptool.tsv, *amptool.tsv]

faa_input (file)

The path to the file corresponding to the respective protein fasta files with ‘.faa’ extension. File names have to contain the corresponding sample name, i.e. sample_1.faa

*.faa

gbk_input (file)

The path to the file corresponding to the respective annotated files with either ‘.gbk’ or ‘.gbff’ extensions. File names must contain the corresponding sample name, i.e. sample_1.faa where “sample_1” is the sample name.

*.gbk

opt_amp_db (directory)

The path to the folder containing the fasta and tsv database files.

*/

Output

Name (Type)
Description
Pattern

meta (map)

Groovy Map containing sample information
e.g. [ id:'sample1', single_end:false ]

sample_dir (directory)

The output directory that contains the summary output and related alignment files for one sample.

/*

contig_gbks (directory)

The output subdirectory that contains the gbk files containing the AMP hits for each sample.

/*/contig_gbks

txt (file)

An alignment file containing the results from the DIAMOND alignment step done on all AMP hits.

/*/*_diamond_matches.txt

tsv (file)

A file containing the summary report of all predicted AMP hits from all AMP tools given as input, the corresponding taxonomic and functional classification from the alignment step and the estimated physiochemical properties.

/*/*_ampcombi.tsv

faa (file)

A fasta file containing the amino acid sequences of all predicted AMP hits.

/*/*_amp.faa

sample_log (file)

A log file that captures the standard output per sample in a log file. Can be activated by --log.

/*/*.log

full_log (file)

A log file that captures the standard output for the entire process in a log file. Can be activated by --log.

Ampcombi_parse_tables.log

results_db (directory)

If the AMP reference database is not provided by the user using the flag `—amp_database’, by default the DRAMP database will be downloaded, filtered and stored in this folder.

/amp_ref_database

results_db_dmnd (file)

AMP reference database converted to DIAMOND database format.

/amp_ref_database/*.dmnd

results_db_fasta (file)

AMP reference database fasta file, cleaned of diamond-uncompatible characters.

/amp_ref_database/*.clean.fasta

results_db_tsv (file)

AMP reference database in tsv-format with two columns containing header and sequence.

/amp_ref_database/*.tsv

versions (file)

File containing software versions

versions.yml

Tools

ampcombi2/parsetables
MIT

A parsing tool to convert and summarise the outputs from multiple AMP detection tools in a standardized format.