Description

Merge multiple transcriptomes while maintaining source information.

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

bed:file

bed12 file generated by TAMA collapse

*.bed

filelist:file

list of files

Output

name:type
description
pattern

bed

meta:map

Groovy Map containing sample information e.g. [ id:‘test’ ]

*.bed:file

This is the main merged annotation file. Transcripts are coloured according to the source support for each model. Sources are numbered based on the order supplied in the input filelist file. For example the first file named in the filelist file would have its transcripts coloured in red. If a transcript has multiple sources the colour is shown as magenta.

*.bed

gene_report

meta:map

Groovy Map containing sample information e.g. [ id:‘test’ ]

*_gene_report.txt:file

This contains a report of the genes from the merged file. “num_clusters” refers to the number of source transcripts that were used to make this gene model. “num_final_trans” refers to the number of transcripts in the final gene model.

*_gene_report.txt

merge

meta:map

Groovy Map containing sample information e.g. [ id:‘test’ ]

*_merge.txt:file

This contains a bed12 format file which shows the coordinates of each input transcript matched to the merged transcript ID. I used the “txt” extension even though it is a bed file just to avoid confusion with the main bed file. You can use this file to map the final merged transcript models to their pre-merged supporting transcripts. The 1st subfield in the 4th column shows the final merged transcript ID while the 2nd subfield shows the pre-merged transcript ID with source prefix.

*_merge.txt

trans_report

meta:map

Groovy Map containing sample information e.g. [ id:‘test’ ]

*_trans_report.txt:file

This contains the source information for each merged transcript.

*_trans_report.txt

versions

versions.yml:file

File containing software versions

versions.yml

Tools