Description

Perform quantification with Salmon or Kallisto to produce count tables and SummarizedExperiment objects

Input

name:type
description
pattern

meta :map

Groovy Map containing study-level sample sheet information. e.g. [
id:‘SRP1234’ ].

samplesheet :file

Sample sheet, to be baked into the colData of summarizedexperiment
objects.

*.{csv,tsv}

reads :file

Channel with input FastQ files of size 1 and 2 for single-end and
paired-end data, respectively. OR a transcriptome-level BAM file if
running Salmon in alignment mode.

index :path

Path to Salmon or Kallisto index in the tool-appropriate form.

gtf :file

Channel with features in GTF format. Passed to pseudoaligners and used
to generate transcript/ gene mappings.

gtf_id_attribute :string

Attribute in GTF file corresponding to the gene identifier.

gtf_extra_attribute :string

GTF alternative gene attribute (e.g. gene_name)

pseudo_aligner :string

Pseudoaligner, kallisto or salmon.

alignment_mode :boolean

If running Salmon, run in alignment mode (true or false).

lib_type :string

String to override Salmon library type.

kallisto_quant_fraglen :integer

Estimated fragment length. Required if running Kallisto with
single-ended reads.

kallisto_quant_fraglen_sd :integer

Estimated standard error for fragment length required by Kallisto in
single-end mode.

Output

name:type
description
pattern

meta :map

Groovy Map containing study-level sample sheet information. e.g. [
id:‘SRP1234’ ].

results :file

Channel containing sample-wise results directories from the
pseudoaligner.

multiqc :file

Channel containing those pseudoaligner outputs readable by MultiQC for
passing to workflow-level reporting.

tpm_gene :file

Gene-level matrix of abundance values in TPM.

*.gene_tpm.tsv

counts_gene :file

Gene-level matrix of unadjusted estimated counts from tximport
(countsFromAbundance = 'no').

*.gene_counts.tsv

lengths_gene :file

Gene-level matrix of length values for modelling in downstream
analysis.

gene_lengths.tsv

counts_gene_length_scaled :file

Gene-level matrix of estimated counts, generated from abundance (TPM)
values by scaling to library size, additionally scaled using the
average transcript length, averaged over samples and to library size,
using tximport countsFromAbundance = 'lengthScaledTPM'.

*.gene_counts_length_scaled.tsv

counts_gene_scaled :file

Gene-level matrix of estimated counts, generated from abundance (TPM)
values by scaling to library size with tximport countsFromAbundance = 'scaledTPM'.

*.gene_counts_length_scaled.tsv

tpm_transcript :file

Transcript-level matrix of abundance values in TPM.

*.transcript_tpm.tsv

counts_transcript :file

Transcript-level matrix of unadjusted estimated counts from tximport
(countsFromAbundance = 'no').

*.transcript_counts.tsv

lengths_transcript :file

Transcript-level matrix of length values for modelling in downstream
analysis.

transcript_lengths.tsv

merged_gene_rds :file

Serialised SummarizedExperiment object containing gene level TPM
abundance values and counts generated from tximport with
countsFromAbundance = 'no'.

*.rds

merged_gene_rds_length_scaled :file

Serialised SummarizedExperiment object containing gene level TPM
abundance values and counts generated from tximport with
countsFromAbundance='lengthScaledTPM'.

*.rds

merged_gene_rds_scaled :file

Serialised SummarizedExperiment object containing gene level TPM
abundance values and counts generated from tximport with
countsFromAbundance='scaledTPM'.

*.rds

merged_transcript_rds :file

Serialised SummarizedExperiment object containing transcript level TPM
abundance values and counts generated from tximport with
countsFromAbundance = 'no'.

*.rds

versions :file

File containing software versions
Structure: [ path(versions.yml) ]

versions.yml