Description

Apply a score cutoff to filter variants based on a recalibration table. AplyVQSR performs the second pass in a two-stage process called Variant Quality Score Recalibration (VQSR). Specifically, it applies filtering to the input variants based on the recalibration table produced in the first step by VariantRecalibrator and a target sensitivity value.

Input

name:type
description
pattern

meta

:map

Groovy Map containing sample information e.g. [ id:‘test’]

vcf

:file

VCF file to be recalibrated, this should be the same file as used for the first stage VariantRecalibrator.

*.vcf

vcf_tbi

:file

tabix index for the input vcf file.

*.vcf.tbi

recal

:file

Recalibration file produced when the input vcf was run through VariantRecalibrator in stage 1.

*.recal

recal_index

:file

Index file for the recalibration file.

.recal.idx

tranches

:file

Tranches file produced when the input vcf was run through VariantRecalibrator in stage 1.

.tranches

fasta

:file

The reference fasta file

*.fasta

fai

:file

Index of reference fasta file

*.fasta.fai

dict

:file

GATK sequence dictionary

*.dict

Output

name:type
description
pattern

vcf

meta

:file

compressed vcf file containing the recalibrated variants.

*.vcf.gz

*.vcf.gz

:file

compressed vcf file containing the recalibrated variants.

*.vcf.gz

tbi

meta

:file

compressed vcf file containing the recalibrated variants.

*.vcf.gz

*.tbi

:file

Index of recalibrated vcf file.

*vcf.gz.tbi

versions_gatk4

${task.process}

:string

The name of the process

gatk4

:string

The name of the tool

gatk --version | sed -n '/GATK.*v/s/.*v//p'

:eval

The expression to obtain the version of the tool

Topics

name:type
description
pattern

versions

${task.process}

:string

The name of the process

gatk4

:string

The name of the tool

gatk --version | sed -n '/GATK.*v/s/.*v//p'

:eval

The expression to obtain the version of the tool

Tools

gatk4
Apache-2.0

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size.