Description

Runs PEKA CLIP peak k-mer analysis

Input

name:type
description
pattern

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

peaks:file

BED file of peak regions

*.{bed,bed.gz}

meta:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

BED file of crosslinks

*.{bed,bed.gz}

fasta:file

Genome reference sequence used

*.{fa,fasta}

fai:file

FAI file corresponding to the reference sequence

*.{fai}

gtf:file

A segmented GTF used to annotate peaks

*.{gtf}

Output

name:type
description
pattern

cluster

meta:file

TSV file of summed occurrence distributions of k-mers within defined clusters

*.tsv

*mer_cluster_distribution*:file

TSV file of summed occurrence distributions of k-mers within defined clusters

*.tsv

distribution

meta:file

TSV file with calculated PEKA score and occurrence distribution for all possible k-mers

*.tsv

*mer_distribution*:file

TSV file with calculated PEKA score and occurrence distribution for all possible k-mers

*.tsv

rtxn

meta:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*rtxn*:file

rtxn file

*rtxn*

pdf

meta:file

PDF file with graphs showing k-mer occurrence distributions around thresholded crosslink sites

*.pdf

*.pdf:file

PDF file with graphs showing k-mer occurrence distributions around thresholded crosslink sites

*.pdf

tsites

meta:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*thresholded_sites*.bed.gz:file

BED file of thresholded sites

*thresholded_sites*.bed.gz

oxn

meta:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*oxn*.bed.gz:file

BED file of oxn sites

*oxn*.bed.gz

clust

meta:map

Groovy Map containing sample information e.g. [ id:'sample1', single_end:false ]

*_clusters.csv:file

CSV file of clusters

*_clusters.csv

versions

versions.yml:file

File containing software versions

versions.yml

Tools

peka
GPL v3

Positionally-enriched k-mer analysis (PEKA) is a software package for identifying enriched protein-RNA binding motifs from CLIP datasets