nf-core/alleleexpression
Alleleexpression is a nf-core pipeline for allele-specific expression (ASE) analysis using STAR-WASP for alignment, UMI-tools for deduplication, and phaser for haplotype phasing and ASE detection.
nf-core/alleleexpression
Overview
Alleleexpression is a Nextflow pipeline for allele-specific expression (ASE) analysis using STAR-WASP for alignment, UMI-tools for deduplication, and phaser for haplotype phasing and ASE detection.
Features
- STAR alignment with WASP mode for allele-specific mapping
- UMI-based deduplication
- Chromosome-specific analysis with configurable chromosome selection
- Beagle phasing integration
- Phaser-based allele-specific expression analysis
- Comprehensive QC with FastQC and MultiQC reporting
Requirements
- Nextflow (>=21.10.3)
- Singularity or Docker
- Reference genome and annotation files
- Beagle reference panel and genetic map (for phasing)
Pipeline steps
- Input validation and VCF preparation
- FastQC for raw reads
- STAR alignment with WASP mode
- Filtering of WASP-passing reads
- UMI-based deduplication
- Sorting and indexing of BAM files
- Chromosome extraction from VCF
- Beagle phasing
- Phaser for haplotype-level expression
- Phaser_gene_ae for gene-level ASE
- Extraction of ASE genes
- MultiQC report generation
Output
The pipeline organizes outputs by sample name in the specified output directory:
fastqc/
: FastQC reportsstar/
: STAR alignment resultswasp/
: WASP-filtered BAM filesumi/
: UMI-deduplicated BAM filesbeagle/
: Phased VCF filesphaser/
: Phaser resultsase/
: Allele-specific expression resultsmultiqc/
: MultiQC report
Credits
- Pipeline framework: nf-core
- Tools: FastQC, STAR, UMI-tools, samtools, bcftools, Beagle, phaser, MultiQC
Contributing
Alleleexpression is under active development and we welcome contributions! If you find a bug, have an idea to improve it, or want to help implement new features (like better sex-chromosome support), feel free to open an issue or submit a pull request.
Let’s build this together.