Before you start
So, you want to add a new pipeline to nf-core - brilliant! Before you start typing, check that you’re happy with the following points:
- You’re familiar with nf-core and Nextflow (see our introduction docs).
- You’re used to working with
git
and GitHub (see a nice tutorial here) - The workflow you’re thinking of meets the nf-core guidelines.
The main steps involved in adding a new nf-core pipeline covered below are:
- Joining the community
- Contribution overview
- Testing
- Patching bugs
- Getting help
- Pipeline contribution conventions
Join the community
At its heart, nf-core is a community. To contribute to a pipeline, you should be part of that community! Please request to join the nf-core GitHub organisation) and join the nf-core Slack.
Each nf-core pipeline has its own Slack channel that can be found by searching the channel list.
Contribution overview
It’s good to introduce your idea early on so that it can be discussed, before you spend lots of time coding.
If you’d like to write some code for an nf-core pipeline, the standard workflow is as follows:
- Check that there isn’t already an issue about your idea for that pipeline to avoid duplicating work. If there isn’t one already, please create one so that others know you’re working on this
- Fork the pipeline repository to your GitHub account
- Make the necessary changes / additions on
dev
branch by using git checkout within your forked repository following pipeline conventions - Use
nf-core pipelines schema build
and add any new parameters to the pipeline JSON schema (requires nf-core tools >= 1.10). - Submit a Pull Request against the
dev
branch and wait for the code to be reviewed and merged. See guidelines for this.
If you’re new to working with git, you can view the GitHub pull requests documentation or other excellent git
resources to get started.
Testing
You can optionally test your changes by running the pipeline locally. Then it is recommended to use the debug
profile to
receive warnings about process selectors and other debug info. Example: nextflow run . -profile debug,test,docker --outdir <OUTDIR>
.
When you create a pull request with changes, GitHub Actions will run automatic tests. Typically, pull-requests are only fully reviewed when these tests are passing, though of course, we can help out before then.
There are typically two types of tests that run:
Lint tests
nf-core
has a set of guidelines which all pipelines must adhere to.
To enforce these and ensure that all pipelines stay in sync, we have developed a helper tool that runs checks on the pipeline code. This is in the nf-core/tools repository and once installed can be run locally with the nf-core pipelines lint <pipeline-directory>
command.
If any failures or warnings are encountered, please follow the listed URL for more documentation.
Pipeline tests
Each nf-core
pipeline should be set up with a minimal set of test data.
GitHub Actions
then runs the pipeline on this data to ensure that it exits successfully.
If there are any failures then the automated tests fail.
These tests are run both with the latest available version of Nextflow
and also the minimum required version that is stated in the pipeline code.
Patching bugs
Only in the unlikely and regretful event of a release happening with a bug.
- On your own fork, make a new branch
patch
based onupstream/master
. - Fix the bug, and bump version (X.Y.Z+1).
- A PR should be made on
master
from patch to directly resolve this particular bug.
Getting help
For further information/help, please consult the usage documentation for the particular pipeline and don’t hesitate to get in touch on the nf-core Slack #<pipeline-name>
channel. If you are not already a member you can (join our Slack here).
Pipeline contribution conventions
To make the nf-core pipeline code and processing logic more understandable for new contributors and to ensure quality, we semi-standardize the way the code and other contributions are written.
Adding a new step
If you wish to contribute a new step, please use the following coding standards:
- Define the corresponding input channel into your new process from the expected previous process channel.
- Write the process block (see below).
- Define the output channel if needed (see below).
- Add any new parameters to
nextflow.config
with a default (see below). - Add any new parameters to
nextflow_schema.json
with help text (via thenf-core pipelines schema build
tool). - Add sanity checks and validation for all relevant parameters.
- Perform local tests to validate that the new code works as expected.
- If applicable, add a new test command in
.github/workflow/ci.yml
. - Update MultiQC config
assets/multiqc_config.yml
so relevant suffixes, file name clean up and module plots are in the appropriate order. If applicable, add a MultiQC module. - Add a description of the output files and if relevant any appropriate images from the MultiQC report to
docs/output.md
.
Default values
Parameters should be initialised / defined with default values in nextflow.config
under the params
scope.
Once there, use nf-core pipelines schema build
to add to nextflow_schema.json
.
Default processes resource requirements
Sensible defaults for process resource requirements (CPUs / memory / time) for a process should be defined in conf/base.config
. These should generally be specified generic with withLabel:
selectors so they can be shared across multiple processes/steps of the pipeline. An nf-core standard set of labels that should be followed where possible can be seen in the nf-core pipeline template, which has the default process as a single core-process, and then different levels of multi-core configurations for increasingly large memory requirements defined with standardised labels.
The process resources can be passed on to the tool dynamically within the process with the ${task.cpu}
and ${task.memory}
variables in the script:
block.
Naming schemes
Please use the following naming schemes, to make it easy to understand what is going where.
- initial process channel:
ch_output_from_<process>
- intermediate and terminal channels:
ch_<previousprocess>_for_<nextprocess>
Nextflow version bumping
If you are using a new feature from core Nextflow, you may bump the minimum required version of Nextflow in the pipeline with: nf-core pipelines bump-version --nextflow . [min-nf-version]
Images and figures
For overview images and other documents, we follow the nf-core style guidelines and examples.