Set BWA and bwamem2 index memory dynamically #6628

edmundmiller · 2024-09-11T17:27:57Z

Kept having bwamem2 index tasks that ran forever and failed.
Updated bwamem2 to use 28.B of memory per byte of fasta. Issue for reference: bwa-mem2/bwa-mem2#9

Also tracked down the required memory for bwa index while I was at it. Doesn't seem to fail because most of the genome requirements are under the necessary memory.

Not the first place where people have run into this #6628

bwa-mem2/bwa-mem2#9

This one doesn't fail, but it's good practice.

matthdsm · 2024-09-11T18:29:52Z

I like it 👍 it’ll also play nice with the new resourceLimits directive

ewels · 2024-09-11T19:48:05Z

I was talking to @drpatelh about this earlier this week. Sounds good. Very neat if it scales in such a linear way.

Should we add a baseline of additional memory?

edmundmiller · 2024-09-11T21:01:38Z

Closes nf-core/sarek#1377

maxulysse · 2024-09-12T15:23:39Z

What can we do for bwa mem and bwamem2 mem?

edmundmiller · 2024-09-24T13:57:23Z

What can we do for bwa mem and bwamem2 mem?

What do you mean?

muffato · 2024-10-16T09:56:13Z

Is it right to have these settings hardcoded in the module ? How does it interact with pipeline-level config file doing

withName BWAMEM2_INDEX {
    memory { ... }
}

which one takes precedence ?

matthdsm · 2024-10-16T09:58:49Z

AFAIK the pipeline config takes precedence over the config hardcoded in the module.
If you're worried about requesting too much resources, the resourceLimit directive should take care of that nicely

muffato · 2024-10-16T10:05:58Z

I was more worried that 28 GB / Gbp is still too high in my view. I use 24 GB / Gbp in my pipelines and wouldn't want nf-core to force me to waste RAM ;)
Also, your memory definition doesn't consider task.attempt. Are you absolutely certain that 28 GB / Gbp will work for every genome ? Usually, nf-core resource definitions always factor task.attempt.

I wasn't worried of check_max missing since nf-core is about to mandate a recent Nextflow that supports resourceLimit.

muffato · 2024-10-16T10:28:10Z

FYI, I've just checked our LSF logs and there's been zero memory failures over the 1,698 BWAMEM2_INDEX processes that we ran in 2024 with 24 GB/Gbp.
The memory efficiency is ~76% (median), and goes up to 95%, meaning that 23 GB/Gbp might still work for all genomes (it's just at the limit), but 22 GB/Gbp for sure would yield some memory errors.

Regardless of the scaling factor you use, I'd still keep task.attempt just in case (I'm overcautious !).

edmundmiller requested review from maxulysse and drpatelh as code owners September 11, 2024 17:27

edmundmiller self-assigned this Sep 11, 2024

edmundmiller added 2 commits September 11, 2024 12:36

fix: Update bwamem2 to use 28.B of memory per byte of fasta

9c0b700

bwa-mem2/bwa-mem2#9

chore: Set expected memory for bwa index

f10c3f8

This one doesn't fail, but it's good practice.

edmundmiller force-pushed the bwamemory branch from 9d9d274 to f10c3f8 Compare September 11, 2024 17:36

edmundmiller requested a review from FriederikeHanssen September 11, 2024 17:37

fix: Round up to 6.B because 5.5.B breaks parser

75343d0

edmundmiller mentioned this pull request Sep 11, 2024

Improve module specific resource requests #6629

Open

edmundmiller mentioned this pull request Sep 30, 2024

Bwamemory nf-core/nascent#166

Merged

matthdsm approved these changes Oct 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set BWA and bwamem2 index memory dynamically #6628

Set BWA and bwamem2 index memory dynamically #6628

edmundmiller commented Sep 11, 2024 •

edited

Loading

matthdsm commented Sep 11, 2024

ewels commented Sep 11, 2024

edmundmiller commented Sep 11, 2024

maxulysse commented Sep 12, 2024

edmundmiller commented Sep 24, 2024

muffato commented Oct 16, 2024

matthdsm commented Oct 16, 2024

muffato commented Oct 16, 2024

muffato commented Oct 16, 2024

Set BWA and bwamem2 index memory dynamically #6628

Are you sure you want to change the base?

Set BWA and bwamem2 index memory dynamically #6628

Conversation

edmundmiller commented Sep 11, 2024 • edited Loading

matthdsm commented Sep 11, 2024

ewels commented Sep 11, 2024

edmundmiller commented Sep 11, 2024

maxulysse commented Sep 12, 2024

edmundmiller commented Sep 24, 2024

muffato commented Oct 16, 2024

matthdsm commented Oct 16, 2024

muffato commented Oct 16, 2024

muffato commented Oct 16, 2024

edmundmiller commented Sep 11, 2024 •

edited

Loading