Description

Tool for imputation and phasing from vcf file or directly from bam files.

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

input:file

Either one or multiple BAM/CRAM files in an array containing low-coverage sequencing reads or one VCF/BCF file containing the genotype likelihoods. When using BAM/CRAM the name of the file is used as samples name.

*.{bam,cram,vcf,vcf.gz,bcf,bcf.gz}

input_index:file

Index file of the input BAM/CRAM/VCF/BCF file.

*.{bam.bai,cram.crai,vcf.gz.csi,bcf.gz.csi}

bamlist:file

File containing the list of BAM/CRAM files to be phased. One file per line and a second column can be added to indicate the sample name.

*.{txt,tsv}

samples_file:file

File with sample names and ploidy information. One sample per line with a mandatory second column indicating ploidy (1 or 2). Sample names that are not present are assumed to have ploidy 2 (diploids). GLIMPSE does NOT handle the use of sex (M/F) instead of ploidy.

*.{txt,tsv}

input_region:string

Target region used for imputation, including left and right buffers (e.g. chr20:1000000-2000000). Optional if reference panel is in bin format.

chrXX:leftBufferPosition-rightBufferPosition

output_region:string

Target imputed region, excluding left and right buffers (e.g. chr20:1000000-2000000). Optional if reference panel is in bin format.

chrXX:leftBufferPosition-rightBufferPosition

reference:file

Reference panel of haplotypes in VCF/BCF format.

*.{vcf.gz,bcf.gz}

reference_index:file

Index file of the Reference panel file.

*.{vcf.gz.csi,bcf.gz.csi}

map:file

File containing the genetic map. Optional if reference panel is in bin format.

*.gmap

meta2:map

Groovy Map containing genomic map information e.g. [ map:'GRCh38' ]

fasta_reference:file

Faidx-indexed reference sequence file in the appropriate genome build. Necessary for CRAM files.

*.fasta

fasta_reference_index:file

Faidx index of the reference sequence file in the appropriate genome build. Necessary for CRAM files.

*.fai

Output

name:type
description
pattern

phased_variants

meta:map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

*.{vcf,vcf.gz,bcf,bgen}:file

Output VCF/BCF file containing genotype probabilities (GP field), imputed dosages (DS field), best guess genotypes (GT field), sampled haplotypes in the last (max 16) main iterations (HS field) and info-score.

*.{vcf,bcf,vcf.gz,bcf.gz}

stats_coverage

meta:map

Groovy Map containing sample information e.g. [ id:'test', single_end:false ]

*.txt.gz:file

Optional coverage statistic file created when BAM/CRAM files are used as inputs.

*.txt.gz

versions

versions.yml:file

File containing software versions

versions.yml

Tools

glimpse2
MIT

GLIMPSE2 is a phasing and imputation method for large-scale low-coverage sequencing studies.