Filter, sort and markdup sam/bam files, with optional BQSR and variant calling.
Input
name:type
description
pattern
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
bam{:bash}
:file
Input SAM/BAM file
*.{bam,sam}
bai{:bash}
:file
Input BAM file index
*.bai
target_regions_bed{:bash}
:file
Optional BED file containing target regions for BQSR and variant calling.
*.bed
filter_regions_bed{:bash}
:file
Optional BED file containing regions to filter.
*.bed
intermediate_bqsr_tables{:bash}
:file
Optional list of BQSR tables, used when parsing files created by elprep split
*.table
recall_file{:bash}
:file
Recall file with intermediate results for bqsr
*.recall
meta2{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
reference_sequences{:bash}
:file
Optional SAM header to replace existing header.
*.sam
meta3{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
reference_elfasta{:bash}
:file
Elfasta file, required for BQSR and variant calling.
*.elfasta
meta4{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
known_sites_elsites{:bash}
:file
Optional elsites file containing known SNPs for BQSR.
*.elsites
run_haplotypecaller{:bash}
:boolean
Run variant calling on the input files. Needed to generate gvcf output.
run_bqsr{:bash}
:boolean
Run BQSR on the input files. Needed to generate recall metrics.
bqsr_tables_only{:bash}
:boolean
Write intermediate BQSR tables, used when parsing files created by elprep split.
get_activity_profile{:bash}
:boolean
Get the activity profile calculated by the haplotypecaller to the given file in IGV format.
get_assembly_regions{:bash}
:boolean
Get the assembly regions calculated by haplotypecaller to the specified file in IGV format.
Output
name:type
description
pattern
bam{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.{bam,sam}{:bash}
:file
BAM/SAM file
*.{bam,sam}
logs{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.log{:bash}
:file
Log file
elprep-*.log
metrics{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.metrics.txt{:bash}
:file
Metrics file
*.{metrics.txt}
recall{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.recall{:bash}
:file
Recall file
*.{recall}
gvcf{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.vcf.gz{:bash}
:file
VCF file
*.{vcf.gz}
table{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.table{:bash}
:file
Table
*.{table}
activity_profile{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.activity_profile.igv{:bash}
:file
Activity profile
*.{activity_profile.igv}
assembly_regions{:bash}
meta{:bash}
:map
Groovy Map containing sample information
e.g. [ id:‘test’, single_end:false ]
*.assembly_regions.igv{:bash}
:file
Assembly regions
*.{assembly_regions.igv}
versions{:bash}
versions.yml{:bash}
:file
File containing software versions
versions.yml
Tools
elprep
AGPL v3
elPrep is a high-performance tool for preparing .sam/.bam files for variant calling in sequencing pipelines. It can be used as a drop-in replacement for SAMtools/Picard/GATK4.