modules/happy_happy

Hap.py is a tool to compare diploid genotypes at haplotype level. Rather than comparing VCF records row by row, hap.py will generate and match alternate sequences in a superlocus. A superlocus is a small region of the genome (sized between 1 and around 1000 bp) that contains one or more variants.

happybenchmarkhaplotypevalidation

https://github.com/nf-core/modules/[...]/modules/nf-core/happy/happy

Description

Input

name:type

description

pattern

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`query_vcf{:bash}`
`:file`

VCF/GVCF file to query

*.{gvcf,vcf}.gz

`truth_vcf{:bash}`
`:file`

gold standard VCF file

*.{gvcf,vcf}.gz

`regions_bed{:bash}`
`:file`

Sparse regions to restrict the analysis to

*.bed

`targets_bed{:bash}`
`:file`

Dense regions to restrict the analysis to

*.bed

`meta2{:bash}`
`:map`

Groovy Map containing fasta file information e.g. [ id:‘test2’]

`fasta{:bash}`
`:file`

FASTA file of the reference genome

*.{fa,fasta}

`meta3{:bash}`
`:map`

Groovy Map containing fai file information e.g. [ id:‘test3’]

`fasta_fai{:bash}`
`:file`

The index of the reference FASTA

*.fai

`meta4{:bash}`
`:map`

Groovy Map containing false_positives_bed file information e.g. [ id:‘test4’]

`false_positives_bed{:bash}`
`:file`

False positive / confident call regions. Calls outside these regions will be labelled as UNK.

*.{bed,bed.gz}

`meta5{:bash}`
`:map`

Groovy Map containing stratification_tsv file information e.g. [ id:‘test5’]

`stratification_tsv{:bash}`
`:file`

Stratification file list in TSV format

*.tsv

`meta6{:bash}`
`:map`

Groovy Map containing stratification_beds file information e.g. [ id:‘test6’]

`stratification_beds{:bash}`
`:file`

One or more BED files used for stratification (these should be referenced in the stratification TSV)

*.bed

Output

name:type

description

pattern

`summary_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.summary.csv{:bash}`
`:file`

A CSV file containing the summary of the benchmarking

*.summary.csv

`roc_all_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.roc.all.csv.gz{:bash}`
`:file`

A CSV file containing ROC values for all variants

*.roc.all.csv.gz

`roc_indel_locations_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.roc.Locations.INDEL.csv.gz{:bash}`
`:file`

A CSV file containing ROC values for all indels

*.roc.Locations.INDEL.csv.gz

`roc_indel_locations_pass_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.roc.Locations.INDEL.PASS.csv.gz{:bash}`
`:file`

A CSV file containing ROC values for all indels that passed all filters

*.roc.Locations.INDEL.PASS.csv.gz

`roc_snp_locations_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.roc.Locations.SNP.csv.gz{:bash}`
`:file`

A CSV file containing ROC values for all SNPs

*.roc.Locations.SNP.csv.gz

`roc_snp_locations_pass_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.roc.Locations.SNP.PASS.csv.gz{:bash}`
`:file`

A CSV file containing ROC values for all SNPs that passed all filters

*.roc.Locations.SNP.PASS.csv.gz

`extended_csv{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.extended.csv{:bash}`
`:file`

A CSV file containing extended info of the benchmarking

*.extended.csv

`runinfo{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.runinfo.json{:bash}`
`:file`

A JSON file containing the benchmarking metrics

*.metrics.json.gz

`metrics_json{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.metrics.json.gz{:bash}`
`:file`

A JSON file containing the run info

*.runinfo.json

`vcf{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.vcf.gz{:bash}`
`:file`

An annotated VCF

*.vcf.gz

`tbi{:bash}`

`meta{:bash}`
`:map`

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

`*.tbi{:bash}`
`:file`

The index of the annotated VCF

*.tbi

`versions{:bash}`

`versions.yml{:bash}`
`:file`

File containing software versions

versions.yml

Tools

happy
BSD-2-clause

Haplotype VCF comparison tools

illumina.com/products/by-type/informatics-products/basespace-sequence-hub/apps/hap-py-benchmarking github.com/Illumina/hap.py

modules/happy_happy

Description

Input

meta{:bash}:map

query_vcf{:bash}:file

truth_vcf{:bash}:file

regions_bed{:bash}:file

targets_bed{:bash}:file

meta2{:bash}:map

fasta{:bash}:file

meta3{:bash}:map

fasta_fai{:bash}:file

meta4{:bash}:map

false_positives_bed{:bash}:file

meta5{:bash}:map

stratification_tsv{:bash}:file

meta6{:bash}:map

stratification_beds{:bash}:file

Output

summary_csv{:bash}

meta{:bash}:map

*.summary.csv{:bash}:file

roc_all_csv{:bash}

meta{:bash}:map

*.roc.all.csv.gz{:bash}:file

roc_indel_locations_csv{:bash}

meta{:bash}:map

*.roc.Locations.INDEL.csv.gz{:bash}:file

roc_indel_locations_pass_csv{:bash}

meta{:bash}:map

*.roc.Locations.INDEL.PASS.csv.gz{:bash}:file

roc_snp_locations_csv{:bash}

meta{:bash}:map

*.roc.Locations.SNP.csv.gz{:bash}:file

roc_snp_locations_pass_csv{:bash}

meta{:bash}:map

*.roc.Locations.SNP.PASS.csv.gz{:bash}:file

extended_csv{:bash}

meta{:bash}:map

*.extended.csv{:bash}:file

runinfo{:bash}

meta{:bash}:map

*.runinfo.json{:bash}:file

metrics_json{:bash}

meta{:bash}:map

*.metrics.json.gz{:bash}:file

vcf{:bash}

meta{:bash}:map

*.vcf.gz{:bash}:file

tbi{:bash}

meta{:bash}:map

*.tbi{:bash}:file

versions{:bash}

versions.yml{:bash}:file

Tools

happy BSD-2-clause

included in

maintainer

get in touch

`meta{:bash}`
`:map`

`query_vcf{:bash}`
`:file`

`truth_vcf{:bash}`
`:file`

`regions_bed{:bash}`
`:file`

`targets_bed{:bash}`
`:file`

`meta2{:bash}`
`:map`

`fasta{:bash}`
`:file`

`meta3{:bash}`
`:map`

`fasta_fai{:bash}`
`:file`

`meta4{:bash}`
`:map`

`false_positives_bed{:bash}`
`:file`

`meta5{:bash}`
`:map`

`stratification_tsv{:bash}`
`:file`

`meta6{:bash}`
`:map`

`stratification_beds{:bash}`
`:file`

`summary_csv{:bash}`

`meta{:bash}`
`:map`

`*.summary.csv{:bash}`
`:file`

`roc_all_csv{:bash}`

`meta{:bash}`
`:map`

`*.roc.all.csv.gz{:bash}`
`:file`

`roc_indel_locations_csv{:bash}`

`meta{:bash}`
`:map`

`*.roc.Locations.INDEL.csv.gz{:bash}`
`:file`

`roc_indel_locations_pass_csv{:bash}`

`meta{:bash}`
`:map`

`*.roc.Locations.INDEL.PASS.csv.gz{:bash}`
`:file`

`roc_snp_locations_csv{:bash}`

`meta{:bash}`
`:map`

`*.roc.Locations.SNP.csv.gz{:bash}`
`:file`

`roc_snp_locations_pass_csv{:bash}`

`meta{:bash}`
`:map`

`*.roc.Locations.SNP.PASS.csv.gz{:bash}`
`:file`

`extended_csv{:bash}`

`meta{:bash}`
`:map`

`*.extended.csv{:bash}`
`:file`

`runinfo{:bash}`

`meta{:bash}`
`:map`

`*.runinfo.json{:bash}`
`:file`

`metrics_json{:bash}`

`meta{:bash}`
`:map`

`*.metrics.json.gz{:bash}`
`:file`

`vcf{:bash}`

`meta{:bash}`
`:map`

`*.vcf.gz{:bash}`
`:file`

`tbi{:bash}`

`meta{:bash}`
`:map`

`*.tbi{:bash}`
`:file`

`versions{:bash}`

`versions.yml{:bash}`
`:file`

happy
BSD-2-clause