nft-utils
Edit

A collection of utilities for working with nf-test

nft-utils

nft-utils is an nf-test plugin to provide additional functions and assertions that fall outside of the typical nf-test features. They were primarily developed by the nf-core community but should be applicable to any nf-tests.

Start using the plugin

To start using the plugin please add it to your nf-test.config file:

nf-test.config

config {
    plugins {
        load "nft-utils@0.0.7"
    }
}

Have a look at the usage documentation for more information on how to start working with the plugin.

Use a development version

To use the development version, please do the following steps:

Clone the nft-utils repository

SSH

git clone git@github.com:nf-core/nft-utils.git

HTTPS

git clone https://github.com/nf-core/nft-utils.git

Run the build script

./build.sh

Add the jar location (visible at the end of the build script output) to the nf-test.config file

nf-test.config

config {
    plugins {
        loadFromFile "full/path/to/the/plugin/jar"
    }
}

Functions usage

Snapshot functions

The plugin adds the following functions to assist with managing pipeline-level nf-test snapshots:

`removeNextflowVersion()`

nf-core pipelines create a yml file listing all the versions of the software used in the pipeline.

Here is an example of this file coming from the rnaseq pipeline.

UNTAR:
  untar: 1.34
Workflow:
  nf-core/rnaseq: v3.16.0dev
  Nextflow: 24.04.4

This function remove the Nextflow version from this yml file, as it is not relevant for the snapshot. Therefore for the purpose of the snapshot, it would consider this to be the contents of the YAML file:

UNTAR:
  untar: 1.34
Workflow:
  nf-core/rnaseq: v3.16.0dev

Usage:

assert snapshot(removeNextflowVersion("$outputDir/pipeline_info/nf_core_rnaseq_software_mqc_versions.yml")).match()

The function also supports wildcard patterns in file paths, which is useful when the exact filename may vary:

assert snapshot(removeNextflowVersion("$outputDir/pipeline_info/*_versions.yml")).match()

The only argument is the path to the file (or wildcard pattern) which must match a versions file in YAML format as per the nf-core standard. When using wildcards, all matching files will be processed and their results merged together.

Note: The returned YAML structure will have all keys sorted alphabetically at both the top level and within nested sections for consistent, predictable output.

`removeFromYamlMap()`

Remove any key or entire section from a YAML file. This function supports two usage patterns and also supports wildcard patterns in file paths.

Remove a specific subkey (3 arguments)

Remove a specific subkey from within a section:

removeFromYamlMap("file.yml", "Workflow", "Nextflow")

Example input:

UNTAR:
  untar: 1.34
Workflow:
  nf-core/rnaseq: v3.16.0dev
  Nextflow: 24.04.4

Result: Only the “Nextflow” subkey is removed from “Workflow”

UNTAR:
  untar: 1.34
Workflow:
  nf-core/rnaseq: v3.16.0dev

Remove an entire section (2 arguments)

Remove an entire top-level section:

removeFromYamlMap("file.yml", "Workflow")

Example input:

UNTAR:
  untar: 1.34
Workflow:
  nf-core/rnaseq: v3.16.0dev
  Nextflow: 24.04.4
Workflow2:
  some: value

Result: The entire “Workflow” section is removed

UNTAR:
  untar: 1.34
Workflow2:
  some: value

Wildcard support

Both usage patterns support wildcard patterns in the file path:

// Remove specific subkey with wildcard
removeFromYamlMap("$outputDir/pipeline_info/*_versions.yml", "Workflow", "Nextflow")
 
// Remove entire section with wildcard
removeFromYamlMap("$outputDir/pipeline_info/*_versions.yml", "Workflow")

Usage in tests

// Remove specific subkey
assert snapshot(removeFromYamlMap("$outputDir/pipeline_info/nf_core_pipeline_software_mqc_versions.yml", "Workflow", "Nextflow")).match()
 
// Remove entire section
assert snapshot(removeFromYamlMap("$outputDir/pipeline_info/nf_core_pipeline_software_mqc_versions.yml", "Workflow2")).match()
 
// Using wildcards
assert snapshot(removeFromYamlMap("$outputDir/pipeline_info/*_versions.yml", "Workflow", "Nextflow")).match()

Arguments:

First argument: Path to the YAML file (supports wildcard patterns like * and ?)
Second argument: The top-level key (section name)
Third argument (optional): The subkey to remove. If omitted, the entire section is removed.

Notes:

When using wildcard patterns, all matching files will be processed and their results merged together.
The returned YAML structure will have all keys sorted alphabetically at both the top level and within nested sections for consistent, predictable output.

`getAllFilesFromDir()`

Warning

This function requires absolute paths and does not support relative paths to params.outdir. Assign the nf-test outputDir variable to params.outdir when calling this function. cf nf-test/docs

  when {
    params {
      outdir = "$outputDir" // Use nf-test global variable to output dir
    }
  }

This function generates a list of all the contents within a directory (and subdirectories), additionally allowing for the inclusion or exclusion of specific files using glob patterns.

The first argument is the directory path to screen for file paths (e.g. a pipeline’s outdir ).
The second argument is a boolean indicating whether to include subdirectory names in the list.
The third argument is a list of glob patterns to exclude.
The fourth argument is a file containing additional glob patterns to exclude.
The fifth argument is a list of glob patterns to include.
The sixth argument is a boolean indicating whether to output relative paths.

In this example, below are the files produced by a pipeline:

results/
├── pipeline_info
│   └── execution_trace_2024-09-30_13-10-16.txt
└── stable
    ├── stable_content.txt
    └── stable_name.txt
 
2 directories, 3 files

One file has stable content and a stable name (stable_content.txt), and one file has unstable contents but a stable name (stable_name.txt). The last file (execution_trace_2024-09-30_13-10-16.txt) has no stable content nor a stable name, as its name is based on the date and time of the pipeline execution.

We aim to snapshot files with stable content, and stable names (for both files and directories), but excluding the completely unstable file.

First, we will specify the following two variables that we will pass to the nf-test snapshot function:

The stable_name variable contains a list of all files and directories, excluding those matching the glob pattern pipeline_info/execution_*.{html,txt} (i.e., the unstable file).
The stable_content variable contains a list of all files, excluding those that match the two glob patterns: pipeline_info/execution_*.{html,txt} and **/stable_name.txt.
- The latter is specified in the tests/getAllFilesFromDir/.nftignore file.

def stable_name    = getAllFilesFromDir(params.outdir, true, ['pipeline_info/execution_*.{html,txt}'], null, ['*', '**/*'])
def stable_content = getAllFilesFromDir(params.outdir, false, ['pipeline_info/execution_*.{html,txt}'], 'tests/getAllFilesFromDir/.nftignore', ['*', '**/*'])

Secondly, we need to supply these two variables to the nf-test snapshot assrtion. The list of files in stable_content can be supplied to the snapshot directly, and nf-test will include the md5sum hash of the file contents. For the list of stable file names with unstable contents, we can use stable_name*.name, to just extract just name of every file in the list for comparison (i.e., without generating the md5sum hash).

def stable_name    = getAllFilesFromDir(params.outdir, true, ['pipeline_info/execution_*.{html,txt}'], null, ['*', '**/*'])
def stable_content = getAllFilesFromDir(params.outdir, false, ['pipeline_info/execution_*.{html,txt}'], 'tests/getAllFilesFromDir/.nftignore', ['*', '**/*'])
assert snapshot(
  stable_content,
  stable_name*.name,
).match()

getAllFilesFromDir() also supports named parameters:

def stable_name       = getAllFilesFromDir(params.outdir, ignore: ['pipeline_info/execution_*.{html,txt}'])
def stable_name_again = getAllFilesFromDir(params.outdir, include: ['stable/*'])
def stable_content    = getAllFilesFromDir(params.outdir, includeDir: false, ignore: ['pipeline_info/execution_*.{html,txt}'], ignoreFile: 'tests/getAllFilesFromDir/.nftignore')

./images/nftignore_meme.png

`getRelativePath()`

Warning

This function requires absolute paths and does not support relative paths to params.outdir. Assign the nf-test outputDir variable to params.outdir when calling this function. cf nf-test/docs

  when {
    params {
      outdir = "$outputDir" // Use nf-test global variable to output dir
    }
  }

This function is used to get the relative path from a list of files compared to a given directory.

results/
├── pipeline_info
│   └── execution_trace_2024-09-30_13-10-16.txt
└── stable
    ├── stable_content.txt
    └── stable_name.txt
 
2 directories, 3 files

Following the previous example, we want to get the relative path of the stable paths in the results directory.

def stable_name    = getAllFilesFromDir(params.outdir, true, ['pipeline_info/execution_*.{html,txt}'], null )

The stable_name variable contains the list of stable files and folders in the results directory.

assert snapshot(
  getRelativePath(stable_name, outputDir)
).match()

By using getRelativePath() we generate in the snapshot:

"content": [
    [
        "pipeline_info",
        "stable",
        "stable/stable_content.txt",
        "stable/stable_name.txt"
    ]
]

A reduced list can be generated by using getAllFilesFromDir() without including the folders in the output.

"content": [
    [
        "stable/stable_content.txt",
        "stable/stable_name.txt"
    ]
]

Without using getRelativePath() and by using *.name to capture the file names, only a flat structure would be generated, as shown below:

"content": [
    [
        "pipeline_info",
        "stable",
        "stable_content.txt",
        "stable_name.txt"
    ]
]

getAllFilesFromDir() named parameters relative can also be used to combine the two functions:

def stable_name       = getAllFilesFromDir(params.outdir, relative: true, ignore: ['pipeline_info/execution_*.{html,txt}'] )
def stable_name_again = getAllFilesFromDir(params.outdir, relative: true, include: ['stable/*'] )

`listToMD5()`

This function takes a list of values as input and converts the sequence to a MD5 hash. All values in the list should be of a type that can be converted to a string, otherwise the function will fail.

A common use case for this function could be to read a file, remove all unstable lines from it and regerenate an MD5 hash.

`filterNextflowOutput()`

This function filters Nextflow stdout/stderr output to remove variable content that makes snapshots unstable. It censor common patterns like timestamps, run names, runtime-specific information to make test snapshots reproducible. It also removes other common patterns like Nextflow message to update for a new version or empty lines to make test snapshots reproducible.

The function can be called with multiple parameters:

// Basic usage - works directly with workflow.stdout and workflow.stderr, or even both
def filtered_stdout = filterNextflowOutput(workflow.stdout)
def filtered_stderr = filterNextflowOutput(workflow.stderr)
def filtered_both = filterNextflowOutput(workflow.stdout + workflow.stderr)
 
// Control ANSI escape code stripping (enabled by default)
def filtered_with_ansi_stripped = filterNextflowOutput(workflow.stdout + workflow.stderr, keepAnsi: true)
 
// Ignore lines containing specific strings
def filtered_with_ignore = filterNextflowOutput(workflow.stdout, ignore: ["Submitted process"])
 
// Include lines containing specific strings
def filtered_with_include = filterNextflowOutput(workflow.stdout, include: ["Submitted process"])

These lines are sorted alphabetically, once censored:

Staging foreign file messages (file staging operations)
Submitted process messages (process submissions)
Check * file for details messages (error/log references)
WARN: messages (warning logs)
ERROR: messages (error logs)

This behaviour can be disabled by setting sorted: false, which is not recommended as it will cause the snapshot to fail.

Other lines are kept in their original order.

Other log messages (INFO, etc.)
Execution output and results
All other content

For example, process submissions like

[57/0d391c] Submitted process > FASTQC (sample_2)
[6f/3be732] Submitted process > FASTQC (sample_1)
[6d/0082ab] Submitted process > FASTQC (sample_3)

will be consistently ordered as

[PROCESS_HASH] Submitted process > FASTQC (sample_1)
[PROCESS_HASH] Submitted process > FASTQC (sample_2)
[PROCESS_HASH] Submitted process > FASTQC (sample_3)

ANSI escape codes (colors, formatting) are stripped by default to ensure clean, consistent snapshots. This prevents color codes from appearing as garbled text like \u001B[32mtext\u001B[0m or being misinterpreted by other filtering patterns. You can disable this by setting keepAnsi: true if you need to preserve formatting codes.

Common patterns that are automatically filtered include:

Empty lines
- Blank lines and whitespace-only lines are removed entirely
Timestamps
- Various formats (ISO 8601, log timestamps, etc.) are replaced with [TIMESTAMP]
Process hashes
- Nextflow process hashes are replaced with [PROCESS_HASH]
File paths
- Absolute paths to scripts and logs are replaced with [PATH]
- The common ENV variables are checked if available and replaced with [PATH]
  - HOME, NFT_WORKDIR, NXF_CACHE_DIR, NXF_CONDA_CACHEDIR, NXF_HOME, NXF_SINGULARITY_CACHEDIR, NXF_SINGULARITY_LIBRARYDIR, NXF_TEMP, NXF_WORK
Version information
- “Nextflow X.Y.Z is available” messages are removed
- “N E X T F L O W ~ version 24.04.5” is replaced with “N E X T F L O W ~ version [VERSION]”
- “nf-core/pipeline 1.2.3” is replaced with “nf-core/pipeline [VERSION]”

Example usage in a test:

test("my_pipeline_test") {
 
    when {
        params {
            outdir = "$outputDir"
        }
    }
 
        then {
        assert snapshot(
            filterNextflowOutput(workflow.stdout + workflow.stderr)
        ).match()
    }
}

Dependency management

The plugin also adds the following functions to manage dependences of tests on nf-core components, in situations where they may not otherwise be available (for example, writing tests for cross-organisational subworkflows in non-nf-core repositories).

`nfcoreInitialise()` - set up a temporary nf-core library

In a setup block, use the nfcoreInitialise() function to initialise a temporary nf-core library to install modules into. This function takes the path to the location to set up the library as an argument. It is recommended to use a location inside launchDir as this will initialise a test-specific library.

setup {
    nfcoreInitialise("${launchDir}/library")
}

`nfcoreInstall()` - Install modules to a temporary library

Use the nfcoreInstall() function to install nf-core modules in a temporary library. This function takes the path to the library and either a list of strings, each with an nf-core module name in tool/subtool format, or a list of maps, with the keys name, sha, and remote (both sha and remote are optional).

setup {
    nfcoreInitialise("${launchDir}/library")
    nfcoreInstall("${launchDir}/library", ["minimap2/index"])
    nfcoreInstall(
      "${launchDir}/library",
        [
          [
            name: "minimap2/align",
            sha: "5850432aab24a1924389b660adfee3809d3e60a9"
          ],
          [
            name: "fastqc",
            remote: "https://github.com/nf-core-test/modules.git"
          ],
          [
            name: "prokka",
            sha: "9627f4367b11527194ef14473019d0e1a181b741"
            remote: "https://github.com/nf-core-test/modules.git"
          ],
        ]
    )
}

`nfcoreLink()` - Link a temporary library to your modules directory

Use the nfcoreLink() function to link a library to your module library. This function takes two arguments, the path to a temporary library, and the location where the modules in the library should be temporarily linked (e.g. ${baseDir}/modules/nf-core):

setup {
    nfcoreInitialise("${launchDir}/library")
    nfcoreInstall("${launchDir}/library", ["minimap2/index", "minimap2/align"])
    nfcoreLink("${launchDir}/library", "${baseDir}/modules/")
}

This creates a symlink to the modules directory of your temporary library at ${baseDir}/modules/nf-core. Using this location, you can refer to the nf-core modules as if they were installed as normal in your tests.

`nfcoreUnlink()` - Unlink a temporary library from your modules directory

To unlink a temporary library after the test has completed, use the nfcoreUnlink() function. It takes the same arguments as nfcoreLink(), and recursively removes all symlinks pointing to the temporary library.

setup {
    nfcoreInitialise("${launchDir}/library")
    nfcoreInstall("${launchDir}/library", ["minimap2/index", "minimap2/align"])
    nfcoreLink("${launchDir}/library", "${baseDir}/modules/")
 
    run("MINIMAP2_INDEX") {
        script "${baseDir}/modules/nf-core/minimap2/index/main.nf
        ...
    }
}
 
when {
    ...
}
 
then {
    ...
}
 
cleanup {
  nfcoreUnlink("${launchDir}/library", "${baseDir}/modules/")
}

`nfcoreDeleteLibrary()` - Completely delete a temporary library

You can use the nfcoreDeleteLibrary() function to completely remove the temporary library, if desired.

 
setup {
    nfcoreInitialise("${launchDir}/library")
    nfcoreInstall("${launchDir}/library", ["minimap2/index", "minimap2/align"])
    nfcoreLink("${launchDir}/library", "${baseDir}/modules/")
 
    run("MINIMAP2_INDEX") {
        script "${baseDir}/modules/nf-core/minimap2/index/main.nf
        ...
    }
}
 
when {
    ...
}
 
then {
    ...
}
 
cleanup {
    nfcoreDeleteLibrary("${launchDir}/library")
}

`sanitizeOutput()` - Sanitize process output to create clean snapshots

The sanitizeOutput() function is used to clean process and workflow outputs by removing the numbered keys. This will create snapshots that are more easy to read by humans.

then {
  assert snapshot(sanitizeOutput(process.out)).match()
}

The function also supports options to control its behaviour:

unstableKeys: A list of keys to treat as unstable and only snapshot the file name (and not the md5sum). This is useful for output entries that contain files with unstable content.

then {
  assert snapshot(sanitizeOutput(process.out, unstableKeys:["zip"])).match()
}

nft-utils Edit

nft-utils

Start using the plugin

Use a development version

SSH

HTTPS

Functions usage

Snapshot functions

removeNextflowVersion()

removeFromYamlMap()

Remove a specific subkey (3 arguments)

Remove an entire section (2 arguments)

Wildcard support

Usage in tests

getAllFilesFromDir()

getRelativePath()

listToMD5()

filterNextflowOutput()

Dependency management

nfcoreInitialise() - set up a temporary nf-core library

nfcoreInstall() - Install modules to a temporary library

nfcoreLink() - Link a temporary library to your modules directory

nfcoreUnlink() - Unlink a temporary library from your modules directory

nfcoreDeleteLibrary() - Completely delete a temporary library

sanitizeOutput() - Sanitize process output to create clean snapshots

nft-utils
Edit

`removeNextflowVersion()`

`removeFromYamlMap()`

`getAllFilesFromDir()`

`getRelativePath()`

`listToMD5()`

`filterNextflowOutput()`

`nfcoreInitialise()` - set up a temporary nf-core library

`nfcoreInstall()` - Install modules to a temporary library

`nfcoreLink()` - Link a temporary library to your modules directory

`nfcoreUnlink()` - Unlink a temporary library from your modules directory

`nfcoreDeleteLibrary()` - Completely delete a temporary library

`sanitizeOutput()` - Sanitize process output to create clean snapshots