Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Duplex basecalling running time #339

Closed
dweemx opened this issue Apr 24, 2023 · 1 comment
Closed

Duplex basecalling running time #339

dweemx opened this issue Apr 24, 2023 · 1 comment

Comments

@dweemx
Copy link

dweemx commented Apr 24, 2023

I tried to run bonito in duplex mode:

bonito \
   duplex \
   --pairs pseudopairs.txt \
   --max-cpus 1 \
   --device cuda:0 \
   dna_r10.3 \
   fast5_dir \
   > duplex.fasta

with 780 pairs using Bonito v0.4.0. I used the following GPU card: NVIDIA Corporation TU104GL [Tesla T4] (1 GPU, 64 cores, 256 Gb RAM). It did not finish after 3h.

[downloaded dna_r10.3.zip]
> loading model
> building read index
CPU: 1% (64 cores) * Memory: 6956/254984MB * Storage: 24/841GB * Net: 12↓/0↑MBps
CPU: 0% (64 cores) * Memory: 6981/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7008/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7008/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7003/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7017/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7010/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7006/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7012/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7021/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7011/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7013/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7030/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7082/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7022/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7028/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7027/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7023/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps
CPU: 0% (64 cores) * Memory: 7015/254984MB * Storage: 24/841GB * Net: 0↓/0↑MBps

Any idea why this taking so long? Are there any benchmark available?

@davidnewman02
Copy link
Collaborator

It seems you've set --max-cpus 1 here, which may be limiting the performance of basecalling?
Duplex is significantly more computationally complex than Simplex basecalling and the joining of the signals is done on the CPU.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants