Description

WARNING - this tool is still experimental and shouldn’t be used in a production setting. Gathers paired-end and split read evidence files for use in the GATK-SV pipeline. Output files are a file containing the location of and orientation of read pairs marked as discordant, and a file containing the clipping location of all soft clipped reads and the orientation of the clipping.

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

evidence_files:file

The evidence files created by CollectSVEvidence. They all need to be of the same type to print the SV evidence.

*.{pe,sr,baf,rd}.txt(.gz)

evidence_indices:file

The indices of the evidence files created by CollectSVEvidence

*.{pe,sr,baf,rd}.txt(.gz).tbi

bed:file

An optional BED file

*.bed

fasta:file

An optional reference FASTA file

*.{fa,fasta}

fasta_fai:file

An optional reference FASTA file index

*.fai

dict:file

The mandatory sequence dictionary file

*.dict

Output

name:type
description
pattern

printed_evidence

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.txt.gz:file

The output file containing the discordant read pairs or the soft clipped reads

*.{pe,sr,baf,rd}.txt.gz

printed_evidence_index

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end

]

*.txt.gz.tbi:file

The index file of the output compressed text file containing the discordant read pairs or the soft clipped reads

*.{pe,sr,baf,rd}.txt.gz.tbi

versions

versions.yml:file

File containing software versions

versions.yml

Tools

gatk4
Apache-2.0

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.