A set of tools written in Perl and C++ for working with VCF files
Input
name:type
description
pattern
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
variant_file{:bash}
:file
variant input file which can be vcf, vcf.gz, or bcf format.
bed{:bash}
:file
bed file which can be used with different arguments in vcftools (optional)
diff_variant_file{:bash}
:file
secondary variant file which can be used with the ‘diff’ suite of tools (optional)
Output
name:type
description
pattern
vcf{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.vcf{:bash}
:file
vcf file (optional)
*.vcf
bcf{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.bcf{:bash}
:file
bcf file (optional)
*.bcf
frq{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.frq{:bash}
:file
Allele frequency for each site (optional)
*.frq
frq_count{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.frq.count{:bash}
:file
Allele counts for each site (optional)
*.frq.count
idepth{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.idepth{:bash}
:file
mean depth per individual (optional)
*.idepth
ldepth{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ldepth{:bash}
:file
depth per site summed across individuals (optional)
*.ildepth
ldepth_mean{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ldepth.mean{:bash}
:file
mean depth per site calculated across individuals (optional)
*.ldepth.mean
gdepth{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.gdepth{:bash}
:file
depth for each genotype in vcf file (optional)
*.gdepth
hap_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.hap.ld{:bash}
:file
r2, D, and D’ statistics using phased haplotypes (optional)
*.hap.ld
geno_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.geno.ld{:bash}
:file
squared correlation coefficient between genotypes encoded as 0, 1 and 2 to represent the number of non-reference alleles in each individual (optional)
*.geno.ld
geno_chisq{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.geno.chisq{:bash}
:file
test for genotype independence via the chi-squared statistic (optional)
*.geno.chisq
list_hap_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.list.hap.ld{:bash}
:file
r2 statistics of the sites contained in the provided input file verses all other sites (optional)
*.list.hap.ld
list_geno_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.list.geno.ld{:bash}
:file
r2 statistics of the sites contained in the provided input file verses all other sites (optional)
*.list.geno.ld
interchrom_hap_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.interchrom.hap.ld{:bash}
:file
r2 statistics for sites (haplotypes) on different chromosomes (optional)
*.interchrom.hap.ld
interchrom_geno_ld{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.interchrom.geno.ld{:bash}
:file
r2 statistics for sites (genotypes) on different chromosomes (optional)
*.interchrom.geno.ld
tstv{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.TsTv{:bash}
:file
Transition / Transversion ratio in bins of size defined in options (optional)
*.TsTv
tstv_summary{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.TsTv.summary{:bash}
:file
Summary of all Transitions and Transversions (optional)
*.TsTv.summary
tstv_count{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.TsTv.count{:bash}
:file
Transition / Transversion ratio as a function of alternative allele count (optional)
*.TsTv.count
tstv_qual{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.TsTv.qual{:bash}
:file
Transition / Transversion ratio as a function of SNP quality threshold (optional)
*.TsTv.qual
filter_summary{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.FILTER.summary{:bash}
:file
Summary of the number of SNPs and Ts/Tv ratio for each FILTER category (optional)
*.FILTER.summary
sites_pi{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.sites.pi{:bash}
:file
Nucleotide divergency on a per-site basis (optional)
*.sites.pi
windowed_pi{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.windowed.pi{:bash}
:file
Nucleotide diversity in windows, with window size determined by options (optional)
*windowed.pi
weir_fst{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.weir.fst{:bash}
:file
Fst estimate from Weir and Cockerham’s 1984 paper (optional)
*.weir.fst
heterozygosity{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.het{:bash}
:file
Heterozygosity on a per-individual basis (optional)
*.het
hwe{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.hwe{:bash}
:file
Contains the Observed numbers of Homozygotes and Heterozygotes and the corresponding Expected numbers under HWE (optional)
*.hwe
tajima_d{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.Tajima.D{:bash}
:file
Tajima’s D statistic in bins with size of the specified number in options (optional)
*.Tajima.D
freq_burden{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ifreqburden{:bash}
:file
Number of variants within each individual of a specific frequency in options (optional)
*.ifreqburden
lroh{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.LROH{:bash}
:file
Long Runs of Homozygosity (optional)
*.LROH
relatedness{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.relatedness{:bash}
:file
Relatedness statistic based on the method of Yang et al, Nature Genetics 2010 (doi:10.1038/ng.608) (optional)
*.relatedness
relatedness2{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.relatedness2{:bash}
:file
Relatedness statistic based on the method of Manichaikul et al., BIOINFORMATICS 2010 (doi:10.1093/bioinformatics/btq559) (optional)
*.relatedness2
lqual{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.lqual{:bash}
:file
per-site SNP quality (optional)
*.lqual
missing_individual{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.imiss{:bash}
:file
Missingness on a per-individual basis (optional)
*.imiss
missing_site{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.lmiss{:bash}
:file
Missingness on a per-site basis (optional)
*.lmiss
snp_density{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.snpden{:bash}
:file
Number and density of SNPs in bins of size defined by option (optional)
*.snpden
kept_sites{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.kept.sites{:bash}
:file
All sites that have been kept after filtering (optional)
*.kept.sites
removed_sites{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.removed.sites{:bash}
:file
All sites that have been removed after filtering (optional)
*.removed.sites
singeltons{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.singletons{:bash}
:file
Location of singletons, and the individual they occur in (optional)
*.singeltons
indel_hist{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.indel.hist{:bash}
:file
Histogram file of the length of all indels (including SNPs) (optional)
*.indel_hist
hapcount{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.hapcount{:bash}
:file
Unique haplotypes within user specified bins (optional)
*.hapcount
mendel{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.mendel{:bash}
:file
Mendel errors identified in trios (optional)
*.mendel
format{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.FORMAT{:bash}
:file
Extracted information from the genotype fields in the VCF file relating to a specified FORMAT identifier (optional)
*.FORMAT
info{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.INFO{:bash}
:file
Extracted information from the INFO field in the VCF file (optional)
*.INFO
genotypes_matrix{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.012{:bash}
:file
Genotypes output as large matrix.
Genotypes of each individual on a separate line.
Genotypes are represented as 0, 1 and 2, where the number represent that number of non-reference alleles.
Missing genotypes are represented by -1 (optional)
*.012
genotypes_matrix_individual{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.012.indv{:bash}
:file
Details the individuals included in the main genotypes_matrix file (optional)
*.012.indv
genotypes_matrix_position{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.012.pos{:bash}
:file
Details the site locations included in the main genotypes_matrix file (optional)
*.012.pos
impute_hap{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.impute.hap{:bash}
:file
Phased haplotypes in IMPUTE reference-panel format (optional)
*.impute.hap
impute_hap_legend{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.impute.hap.legend{:bash}
:file
Impute haplotype legend file (optional)
*.impute.hap.legend
impute_hap_indv{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.impute.hap.indv{:bash}
:file
Impute haplotype individuals file (optional)
*.impute.hap.indv
ldhat_sites{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ldhat.sites{:bash}
:file
Output data in LDhat format, sites (optional)
*.ldhat.sites
ldhat_locs{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ldhat.locs{:bash}
:file
output data in LDhat format, locations (optional)
*.ldhat.locs
beagle_gl{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.BEAGLE.GL{:bash}
:file
Genotype likelihoods for biallelic sites (optional)
*.BEAGLE.GL
beagle_pl{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.BEAGLE.PL{:bash}
:file
Genotype likelihoods for biallelic sites (optional)
*.BEAGLE.PL
ped{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.ped{:bash}
:file
output the genotype data in PLINK PED format (optional)
*.ped
map_{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.map{:bash}
:file
output the genotype data in PLINK PED format (optional)
*.map
tped{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.tped{:bash}
:file
output the genotype data in PLINK PED format (optional)
*.tped
tfam{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.tfam{:bash}
:file
output the genotype data in PLINK PED format (optional)
*.tfam
diff_sites_in_files{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.sites_in_files{:bash}
:file
Sites that are common / unique to each file specified in optional inputs (optional)
*.diff.sites.in.files
diff_indv_in_files{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.indv_in_files{:bash}
:file
Individuals that are common / unique to each file specified in optional inputs (optional)
*.diff.indv.in.files
diff_sites{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.sites{:bash}
:file
Discordance on a site by site basis, specified in optional inputs (optional)
*.diff.sites
diff_indv{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.indv{:bash}
:file
Discordance on a individual by individual basis, specified in optional inputs (optional)
*.diff.indv
diff_discd_matrix{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.discordance.matrix{:bash}
:file
Discordance matrix between files specified in optional inputs (optional)
*.diff.discordance.matrix
diff_switch_error{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.diff.switch{:bash}
:file
Switch errors found between sites (optional)
*.diff.switch
versions{:bash}
versions.yml{:bash}
:file
File containing software versions
versions.yml
Tools
vcftools
LGPL
A set of tools written in Perl and C++ for working with VCF files. This package only contains the C++ libraries whereas the package perl-vcftools-vcf contains the perl libraries