Adding the MRC LMS Jex cluster #561

A-N-Other · 2023-10-02T15:58:43Z

name: MRC LMS Jex cluster config
about: Adding the MRC LMS Jex cluster

Please follow these steps before submitting your PR:

If your PR is a work in progress, include [WIP] in its title
Your PR targets the master branch
You've included links to relevant issues, if any
Requested review from @nf-core/maintainers and/or #request-review on slack

Steps for adding a new config profile:

Add your custom config file to the conf/ directory
Add your documentation file to the docs/ directory
Add your custom profile to the nfcore_custom.config file in the top-level directory
Add your custom profile to the README.md file in the top-level directory
Add your profile name to the profile: scope in .github/workflows/main.yml

maxulysse

LGTM, let's check if all tests are successful

conf/jex.config

pontus · 2023-10-03T06:41:32Z

Looks good (although it looks a bit odd to me with a 4 Tbyte node with only 16 cores).

A-N-Other · 2023-10-03T09:32:19Z

Looks good (although it looks a bit odd to me with a 4 Tbyte node with only 16 cores).

@pontus hmem is actually up to 64 but I don't want jobs being pushed into the hmem queue just because of cpu requests (rather than RAM, obviously) when people use { check_max( x * task.attempt, 'cpus' ) } closures for job resubmissions. I set the max_cpus such that all jobs will fit within the cpu partition, therefore. Is there a better way to do this?

// EDIT // Would this work as I'd anticipate ... ?

process {
  executor = 'slurm'
  queue = {
    if ( task.time <= 6.h && task.cpus <= 8 && task.memory <= 64.GB ) {
      'nice'
    } else if ( task.memory > 256.GB ) {
      params.max_cpus = 64
      params.max_time = 7.d
      'hmem'
    } else {
      'cpu'
    }
  }
  clusterOptions = '--qos qos_batch'
}

pontus · 2023-10-04T09:42:29Z

No, there's no great solution for that. I haven't verified but would expect setting params in such a closure does not work.

One possible (not great) solution could be to add pipeline specific configurations changing the number of cpus requested for the problematic processes, but I maybe would also have taken the shortcut here of just using 16, even though it will waste a lot of cores on the hmem machine (I assume the scheduler is set to allow high-memory low-core jobs).

A-N-Other · 2023-10-04T09:49:55Z

I hadn't seen an example of dynamic setting of max_* in any of the other configs, so that was what I'd figured.

I'd prefer not having the hmem partition clogged at the expense of limiting cpus overall, so I'll leave it as it is. Fairly rare that any nf process is going to be requesting 4T anyway, so it still leaves the rest of the capacity for other jobs that SLURM can backfill into the space.

Adding the MRC LMS Jex cluster

a64af0a

FriederikeHanssen approved these changes Oct 2, 2023

View reviewed changes

maxulysse reviewed Oct 2, 2023

View reviewed changes

pontus reviewed Oct 3, 2023

View reviewed changes

conf/jex.config Show resolved Hide resolved

pontus approved these changes Oct 3, 2023

View reviewed changes

jfy133 merged commit f1d5daa into nf-core:master Nov 10, 2023
103 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding the MRC LMS Jex cluster #561

Adding the MRC LMS Jex cluster #561

A-N-Other commented Oct 2, 2023 •

edited

Loading

maxulysse left a comment

pontus commented Oct 3, 2023

A-N-Other commented Oct 3, 2023 •

edited

Loading

pontus commented Oct 4, 2023

A-N-Other commented Oct 4, 2023 •

edited

Loading

Adding the MRC LMS Jex cluster #561

Adding the MRC LMS Jex cluster #561

Conversation

A-N-Other commented Oct 2, 2023 • edited Loading

name: MRC LMS Jex cluster config about: Adding the MRC LMS Jex cluster

maxulysse left a comment

Choose a reason for hiding this comment

pontus commented Oct 3, 2023

A-N-Other commented Oct 3, 2023 • edited Loading

pontus commented Oct 4, 2023

A-N-Other commented Oct 4, 2023 • edited Loading

A-N-Other commented Oct 2, 2023 •

edited

Loading

name: MRC LMS Jex cluster config
about: Adding the MRC LMS Jex cluster

A-N-Other commented Oct 3, 2023 •

edited

Loading

A-N-Other commented Oct 4, 2023 •

edited

Loading