Description

Select sequences from a large file based on name/ID

Input

name:type
description
pattern

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

sequence{:bash}

:file

Fasta or fastq file containing sequences to be filtered

*.{fa,fna,faa,fasta,fq,fastq}[.gz]

pattern{:bash}

:file

pattern file (one record per line). If no pattern is given, a string can be specified within the args using ‘-p pattern_string’

*.{txt,tsv}

Output

name:type
description
pattern

filter{:bash}

meta{:bash}

:map

Groovy Map containing sample information e.g. [ id:‘test’, single_end:false ]

*.{fa,fq,fa.gz,fq.gz}{:bash}

:file

Fasta or fastq file containing the filtered sequences. Will be gzipped if the input file was gzipped.

versions_seqkit{:bash}

${task.process}{:bash}

:string

The name of the process

seqkit{:bash}

:string

The name of the tool

seqkit version | sed "s/seqkit v//"{:bash}

:eval

The expression to obtain the version of the tool

Topics

name:type
description
pattern

versions{:bash}

${task.process}{:bash}

:string

The name of the process

seqkit{:bash}

:string

The name of the tool

seqkit version | sed "s/seqkit v//"{:bash}

:eval

The expression to obtain the version of the tool

Tools

seqkit
MIT

Cross-platform and ultrafast toolkit for FASTA/Q file manipulation, written by Wei Shen.