Description

A set of tools written in Perl and C++ for working with VCF files

Input

name:type
description
pattern

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

variant_file{:bash}

:file

variant input file which can be vcf, vcf.gz, or bcf format.

bed{:bash}

:file

bed file which can be used with different arguments in vcftools (optional)

diff_variant_file{:bash}

:file

secondary variant file which can be used with the ‘diff’ suite of tools (optional)

Output

name:type
description
pattern

vcf{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.vcf{:bash}

:file

vcf file (optional)

*.vcf

bcf{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.bcf{:bash}

:file

bcf file (optional)

*.bcf

frq{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.frq{:bash}

:file

Allele frequency for each site (optional)

*.frq

frq_count{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.frq.count{:bash}

:file

Allele counts for each site (optional)

*.frq.count

idepth{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.idepth{:bash}

:file

mean depth per individual (optional)

*.idepth

ldepth{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ldepth{:bash}

:file

depth per site summed across individuals (optional)

*.ildepth

ldepth_mean{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ldepth.mean{:bash}

:file

mean depth per site calculated across individuals (optional)

*.ldepth.mean

gdepth{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.gdepth{:bash}

:file

depth for each genotype in vcf file (optional)

*.gdepth

hap_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.hap.ld{:bash}

:file

r2, D, and D’ statistics using phased haplotypes (optional)

*.hap.ld

geno_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.geno.ld{:bash}

:file

squared correlation coefficient between genotypes encoded as 0, 1 and 2 to represent the number of non-reference alleles in each individual (optional)

*.geno.ld

geno_chisq{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.geno.chisq{:bash}

:file

test for genotype independence via the chi-squared statistic (optional)

*.geno.chisq

list_hap_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.list.hap.ld{:bash}

:file

r2 statistics of the sites contained in the provided input file verses all other sites (optional)

*.list.hap.ld

list_geno_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.list.geno.ld{:bash}

:file

r2 statistics of the sites contained in the provided input file verses all other sites (optional)

*.list.geno.ld

interchrom_hap_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.interchrom.hap.ld{:bash}

:file

r2 statistics for sites (haplotypes) on different chromosomes (optional)

*.interchrom.hap.ld

interchrom_geno_ld{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.interchrom.geno.ld{:bash}

:file

r2 statistics for sites (genotypes) on different chromosomes (optional)

*.interchrom.geno.ld

tstv{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.TsTv{:bash}

:file

Transition / Transversion ratio in bins of size defined in options (optional)

*.TsTv

tstv_summary{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.TsTv.summary{:bash}

:file

Summary of all Transitions and Transversions (optional)

*.TsTv.summary

tstv_count{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.TsTv.count{:bash}

:file

Transition / Transversion ratio as a function of alternative allele count (optional)

*.TsTv.count

tstv_qual{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.TsTv.qual{:bash}

:file

Transition / Transversion ratio as a function of SNP quality threshold (optional)

*.TsTv.qual

filter_summary{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.FILTER.summary{:bash}

:file

Summary of the number of SNPs and Ts/Tv ratio for each FILTER category (optional)

*.FILTER.summary

sites_pi{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.sites.pi{:bash}

:file

Nucleotide divergency on a per-site basis (optional)

*.sites.pi

windowed_pi{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.windowed.pi{:bash}

:file

Nucleotide diversity in windows, with window size determined by options (optional)

*windowed.pi

weir_fst{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.weir.fst{:bash}

:file

Fst estimate from Weir and Cockerham’s 1984 paper (optional)

*.weir.fst

heterozygosity{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.het{:bash}

:file

Heterozygosity on a per-individual basis (optional)

*.het

hwe{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.hwe{:bash}

:file

Contains the Observed numbers of Homozygotes and Heterozygotes and the corresponding Expected numbers under HWE (optional)

*.hwe

tajima_d{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.Tajima.D{:bash}

:file

Tajima’s D statistic in bins with size of the specified number in options (optional)

*.Tajima.D

freq_burden{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ifreqburden{:bash}

:file

Number of variants within each individual of a specific frequency in options (optional)

*.ifreqburden

lroh{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.LROH{:bash}

:file

Long Runs of Homozygosity (optional)

*.LROH

relatedness{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.relatedness{:bash}

:file

Relatedness statistic based on the method of Yang et al, Nature Genetics 2010 (doi:10.1038/ng.608) (optional)

*.relatedness

relatedness2{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.relatedness2{:bash}

:file

Relatedness statistic based on the method of Manichaikul et al., BIOINFORMATICS 2010 (doi:10.1093/bioinformatics/btq559) (optional)

*.relatedness2

lqual{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.lqual{:bash}

:file

per-site SNP quality (optional)

*.lqual

missing_individual{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.imiss{:bash}

:file

Missingness on a per-individual basis (optional)

*.imiss

missing_site{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.lmiss{:bash}

:file

Missingness on a per-site basis (optional)

*.lmiss

snp_density{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.snpden{:bash}

:file

Number and density of SNPs in bins of size defined by option (optional)

*.snpden

kept_sites{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.kept.sites{:bash}

:file

All sites that have been kept after filtering (optional)

*.kept.sites

removed_sites{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.removed.sites{:bash}

:file

All sites that have been removed after filtering (optional)

*.removed.sites

singeltons{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.singletons{:bash}

:file

Location of singletons, and the individual they occur in (optional)

*.singeltons

indel_hist{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.indel.hist{:bash}

:file

Histogram file of the length of all indels (including SNPs) (optional)

*.indel_hist

hapcount{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.hapcount{:bash}

:file

Unique haplotypes within user specified bins (optional)

*.hapcount

mendel{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.mendel{:bash}

:file

Mendel errors identified in trios (optional)

*.mendel

format{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.FORMAT{:bash}

:file

Extracted information from the genotype fields in the VCF file relating to a specified FORMAT identifier (optional)

*.FORMAT

info{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.INFO{:bash}

:file

Extracted information from the INFO field in the VCF file (optional)

*.INFO

genotypes_matrix{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.012{:bash}

:file

Genotypes output as large matrix. Genotypes of each individual on a separate line. Genotypes are represented as 0, 1 and 2, where the number represent that number of non-reference alleles. Missing genotypes are represented by -1 (optional)

*.012

genotypes_matrix_individual{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.012.indv{:bash}

:file

Details the individuals included in the main genotypes_matrix file (optional)

*.012.indv

genotypes_matrix_position{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.012.pos{:bash}

:file

Details the site locations included in the main genotypes_matrix file (optional)

*.012.pos

impute_hap{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.impute.hap{:bash}

:file

Phased haplotypes in IMPUTE reference-panel format (optional)

*.impute.hap

impute_hap_legend{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.impute.hap.legend{:bash}

:file

Impute haplotype legend file (optional)

*.impute.hap.legend

impute_hap_indv{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.impute.hap.indv{:bash}

:file

Impute haplotype individuals file (optional)

*.impute.hap.indv

ldhat_sites{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ldhat.sites{:bash}

:file

Output data in LDhat format, sites (optional)

*.ldhat.sites

ldhat_locs{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ldhat.locs{:bash}

:file

output data in LDhat format, locations (optional)

*.ldhat.locs

beagle_gl{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.BEAGLE.GL{:bash}

:file

Genotype likelihoods for biallelic sites (optional)

*.BEAGLE.GL

beagle_pl{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.BEAGLE.PL{:bash}

:file

Genotype likelihoods for biallelic sites (optional)

*.BEAGLE.PL

ped{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.ped{:bash}

:file

output the genotype data in PLINK PED format (optional)

*.ped

map_{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.map{:bash}

:file

output the genotype data in PLINK PED format (optional)

*.map

tped{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.tped{:bash}

:file

output the genotype data in PLINK PED format (optional)

*.tped

tfam{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.tfam{:bash}

:file

output the genotype data in PLINK PED format (optional)

*.tfam

diff_sites_in_files{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.sites_in_files{:bash}

:file

Sites that are common / unique to each file specified in optional inputs (optional)

*.diff.sites.in.files

diff_indv_in_files{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.indv_in_files{:bash}

:file

Individuals that are common / unique to each file specified in optional inputs (optional)

*.diff.indv.in.files

diff_sites{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.sites{:bash}

:file

Discordance on a site by site basis, specified in optional inputs (optional)

*.diff.sites

diff_indv{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.indv{:bash}

:file

Discordance on a individual by individual basis, specified in optional inputs (optional)

*.diff.indv

diff_discd_matrix{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.discordance.matrix{:bash}

:file

Discordance matrix between files specified in optional inputs (optional)

*.diff.discordance.matrix

diff_switch_error{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.diff.switch{:bash}

:file

Switch errors found between sites (optional)

*.diff.switch

versions{:bash}

versions.yml{:bash}

:file

File containing software versions

versions.yml

Tools

vcftools
LGPL

A set of tools written in Perl and C++ for working with VCF files. This package only contains the C++ libraries whereas the package perl-vcftools-vcf contains the perl libraries