EVA Configuration

All nf-core pipelines have been successfully configured for use on the Department of Genetics and Archaeogenetic’s clusters at the Max Planck Institute for Evolutionary Anthropology (MPI-EVA).

To use, run the pipeline with -profile eva. You can further with optimise submissions by specifying which cluster queue you are using e,g, -profile eva,archgen. This will download and launch the eva.config which has been pre-configured with a setup suitable for the all.q queue. The number of parallel jobs that run is currently limited to 8.

Using this profile, a docker image containing all of the required software will be downloaded, and converted to a singularity image before execution of the pipeline.

Institute-specific pipeline profiles exists for:

  • eager
  • mag
  • sarek
  • taxprofiler

Additional Profiles

We currently also offer profiles for the different department’s specific nodes.

archgen

If you specify -profile eva,archgen you will be able to use the nodes available on the archgen.q queue.

Note the following characteristics of this profile:

  • By default, job resources are assigned a maximum number of CPUs of 32, 256 GB maximum memory and 365 day maximum wall time.
  • Using this profile will currently store singularity images in a cache under /mnt/archgen/users/singularity_scratch/cache/. All archgen users currently have read/write access to this directory, however this will likely change to a read-only directory in the future that will be managed by the IT team.
  • Intermediate files will be automatically cleaned up (see debug below if you don’t want this to happen) on successful run completion.
  • Jobs submitted with >700.GB will automatically be submitted to the dynamic bigmem.q.

NB: You will need an account and VPN access to use the cluster at MPI-EVA in order to run the pipeline. If in doubt contact the IT team. NB: Nextflow will need to submit the jobs via SGE to the clusters and as such the commands above will have to be executed on one of the head nodes. If in doubt contact IT.

debug

This simple profile just turns off automatic clean up of intermediate files. This can be useful for debugging. Specify e.g. with -profile eva,archgen.

Config file

See config file on GitHub

eva.config
//Profile config names for nf-core/configs
params {
    config_profile_description = 'Generic MPI-EVA cluster(s) profile provided by nf-core/configs.'
    config_profile_contact = 'James Fellows Yates (@jfy133)'
    config_profile_url = 'https://eva.mpg.de'
}
 
// Preform work directory cleanup after a successful run
cleanup = true
 
singularity {
    enabled = true
    autoMounts = true
}
 
process {
    executor = 'sge'
    penv = 'smp'
    queue = 'all.q'
    clusterOptions = { "-S /bin/bash -V -j y -o output.sge -l h_vmem=${task.memory.toGiga()}G" }
}
 
profiles {
    archgen {
        params {
            igenomes_base = "/mnt/archgen/public_data/igenomes"
            config_profile_description = 'MPI-EVA archgen profile, provided by nf-core/configs.'
            max_memory = 256.GB
            max_cpus = 32
            max_time = 365.d
            //Illumina iGenomes reference file path
        }
 
        process {
            queue = { task.memory > 700.GB ? 'bigmem.q' : 'archgen.q' }
            clusterOptions = { "-S /bin/bash -V -j y -o output.sge -l h_vmem=${task.memory.toGiga()}G" }
        }
 
        singularity {
            cacheDir = "/mnt/archgen/tools/singularity/containers/"
        }
 
    }
    // Profile to deactivate automatic cleanup of work directory after a successful run. Overwrites cleanup option.
    debug {
        cleanup = false
    }
}