Configuration
Edit

How to troubleshoot common mistakes and issues

Manual debugging on clusters using schedulers

In some cases, when testing configuration files on clusters using a scheduler, you may get failed jobs with ‘uninformative’ errors and vague information being given for the cause.

In such cases, a good way of debugging such a failed job is to change to the working directory of the failed process (which should be reported by Nextflow), and try to manually submit the job.

You can do this by submitting to your cluster the .command.run file found in the working directory using the relevant submission command.

For example, let’s say you get an error like this on a SLURM cluster.

Caused by:
  Failed to submit process to grid scheduler for execution
 
Command executed:
 
  sbatch .command.run
 
Command exit status:
  -
 
Command output:
  (empty)
 
Work dir:
  /<path>/<to>/work/e5/6cc8991c2b16c11a6356028228377e

This does not tell you why the job failed to submit, but is often is due to a ‘invalid’ resource submission request, and the scheduler blocks it. But unfortunately, Nextflow does not pick the message reported by the cluster.

Therefore, in this case I would switch to the working directory, and submit the .command.run file using SLURM’s sbatch command (for submitting batch scripts).

$ cd  /<path>/<to>/work/e5/6cc8991c2b16c11a6356028228377e
$ sbatch .command.run
sbatch: error: job memory limit for shared nodes exceeded. Must be <= 120000 MB
sbatch: error: Batch job submission failed: Invalid feature specification

In this case, SLURM has printed to my console the reason why, the job failed to be submitted.

With this information, I can go back to my configuration file, and tweak the settings accordingly, and run the pipeline again.

Usage
- Getting started
- Troubleshooting
- Reference genomes
Contributing
- Pipelines
  - Pipeline file structure
- Software packaging
  - arm64 builds
- nf-test
  - Assertions
- Components
  - ext.args
  - Meta map
- Website
  - Markdown
- Code editors and styling
- Github
  - nf-core bot
- How to contribute to nf-core
- Project proposals
Tutorials
- Devcontainer
- Use nf-core pipelines
  - Writing institutional profiles
- Adding a pipeline
- Tests and test data
- nf-core components
- Pipelines
  - Switching master to main
- Contributing to nf-core
- External usage
  - Cross organisational subworkflows
  - nf-core configs outside nf-core
- Google colab
  - nf-core colab guide
- Migrate to topics
  - Update modules
  - Update pipelines
- Nextflow training
  - Creating with nf-core
  - Nextflow
- nf-core training
  - Overview
  - Contributing training
    
    nf-core contributing overview
  - Writing nf-core modules
    
    Chapter 1: Introduction
    Chapter 2: Setup
    Chapter 3: What is a module?
    Chapter 4: Boilerplate files
    Chapter 5: Writing
    Chapter 6: Testing
    Chapter 7: Dev. Workflow
    Chapter 8: Using
- Storage utilization
  - Managing work directory growth
- Sync
Guidelines
- Graphic design
- Pipelines
- Regulatory
  - Overview
  - Checklist
- Components
- Documentation
  - Writing style
- External use
- Google slides progressbar
- Pull request review
Checklists
- Pipeline release
- Reviews
- Community governance
  - Core team
  - Maintainers team
nf-core/tools
- Installation
- Pipelines
- Modules
- Subworkflows
- Test datasets
- TUI
- Custom remotes
- API Reference

Configuration Edit

Manual debugging on clusters using schedulers

Configuration
Edit