fasta_explore_search_plot_tidk
Uses Telomere Identification toolKit (TIDK) to identify the frequency of telomeric repeats along a sliding window for each sequence in the input fasta file. Results are presented in TSV and SVG formats. The user can specify an a priori sequence for identification. Possible a posteriori sequences are also explored and the most frequent sequence is used for identification similar to the a priori sequence. seqkit/seq and seqkit/sort modules are also included to filter out small sequences and sort sequences by length.
Description
Uses Telomere Identification toolKit (TIDK) to identify the frequency of telomeric repeats along a sliding window for each sequence in the input fasta file. Results are presented in TSV and SVG formats. The user can specify an a priori sequence for identification. Possible a posteriori sequences are also explored and the most frequent sequence is used for identification similar to the a priori sequence. seqkit/seq and seqkit/sort modules are also included to filter out small sequences and sort sequences by length.
Output
Frequency table for the identification of the a priori sequence
Structure: [ val(meta), path(tsv) ]
*.tsv
Frequency graph for the identification of the a priori sequence
Structure: [ val(meta), path(svg) ]
*.svg
The most frequent a posteriori sequence
Structure: [ val(meta), path(txt) ]
*.txt
Frequency table for the identification of the a aposteriori sequence
Structure: [ val(meta), path(tsv) ]
*.tsv
Frequency graph for the identification of the a aposteriori sequence
Structure: [ val(meta), path(svg) ]
*.svg