Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hotfix for v0.3.1 continued #98

Merged
merged 8 commits into from
Jan 31, 2025
Merged

Hotfix for v0.3.1 continued #98

merged 8 commits into from
Jan 31, 2025

Conversation

abhi18av
Copy link
Member

@abhi18av abhi18av commented Dec 11, 2024

Continuing the discussion from #96 (comment)

@abhi18av abhi18av requested a review from jagedn December 11, 2024 18:31
@abhi18av abhi18av self-assigned this Dec 11, 2024
Copy link
Member Author

@abhi18av abhi18av left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Added some comments for @jagedn

validation/sun-nomadlab/chaos_job.yml Show resolved Hide resolved
Comment on lines 112 to 114
final taskCfg = task.getConfig()
final taskCores = !taskCfg.get("cpus") ? DEFAULT_CPUS : taskCfg.get("cpus") as Integer
final taskMemory = taskCfg.get("memory") ? new MemoryUnit( taskCfg.get("memory") as String ) : new MemoryUnit(DEFAULT_MEMORY)
final taskMemory = !taskCfg.get("memory") ? new MemoryUnit(DEFAULT_MEMORY) : new MemoryUnit( taskCfg.get("memory") as String )
Copy link
Member Author

@abhi18av abhi18av Jan 15, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jagedn , I think that the CPU/memory allocation needs to be updated because of a resourceLimits directive which was introduced in Nextflow v24.04.x onwards.

Here's a good overview https://nf-co.re/docs/usage/getting_started/configuration#max-resources of the older --max_cpus and --max_memory parameters and the new native directive.

Basically, as of now, if you try to run nf-core/demo pipeline on sun-nomadlab the FASTQC jobs are not being allocated anywhere since the actual resources in the generated Nomad job definition are different from the global override we provide in nextflow.config file.

  • What is set in nextflow.config

process {
    executor = "nomad"

    cpus=4
    memory= "4.GB"
    time= "1.h"

    //NOTE: Basically using the new directive here, but it is a generalized version of above settings.
    resourceLimits = [ cpus: 4, memory: 4.GB, time: 1.h ]

}



  • What Nomad receives
image

NOTE: The problem seems to be that our config is not overriding the default memory for FASTQC (as assigned via a process_medium label in the pipeline config)

I think this might be related to nextflow-io/nextflow#5306 - what are your thoughts?

In the meantime, I think I'll use a fork to control that behavior 🤔

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

surely
I remember I spent lot of days trying to improve the config process but there are lot of posibilities due profiles...

Signed-off-by: Jorge Aguilera <[email protected]>
@abhi18av abhi18av marked this pull request as ready for review January 31, 2025 10:36
@abhi18av abhi18av merged commit c5b14fa into master Jan 31, 2025
1 check failed
@abhi18av abhi18av deleted the hotfix/fix-0.3.1 branch January 31, 2025 10:40
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants