Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Minialign using large volumes of memory and crashing out due to a segmentation fault #12

Open
Jacob-Ferrier opened this issue Jun 20, 2019 · 1 comment

Comments

@Jacob-Ferrier
Copy link

Hello,

I am attempting to use minialign to align subreads against a reference fasta. I am having two issues:

Issue 1:
I have generated the index using the following command (where REFERENCE points to the reference fasta):

/users/jferrier/software/minialign/minialign -t 64 -d index.mai ${REFERENCE}

The process was given 4TB of memory as well as a walltime of 256 hours.
The process succeed however it used approximately 3.7TB of memory. The reference is approximately 415GB. The resultant index is approximately 438GB. Do you know what is using so much memory?

Issue 2:
After successfully generating the index I have attempted to align some reads against the index using the following command (where READS points to the reads fastq):

/users/jferrier/software/minialign/minialign -t 64 -v 2 index.mai ${READS} > minialign_test_alignment.sam

This process was given 2TB of memory as well as a walltime of 256 hours.
This process failed and all that was written out was the following:

[M::main] Version: 0.6.0-44-g5fd40a5, Build: AVX2
[M::main_align::1288.965*7.19] loaded/built index for 41818118 target sequence(s).
/var/spool/torque/mom_priv/jobs/1183618.cph-m1.uncc.edu.SC: line 44: 136454 Segmentation fault (core dumped) /users/jferrier/software/minialign/minialign -t 64 -v 2 index.mai ${READS} > minialign_test_alignment.sam

The process used approximately 1.16TB of memory before crashing out in approximately 2.5 hours. As I mentioned before the index.mai file is approximately 438GB. The reads file is approximately 795MB.
The process does generate minialign_test_alignment.sam with is approximately 3.0GB however the process does not finish. Do you know why the process is crashing out?

This very well could be user error, please let me know if you have any insight or if I could provide any other information to help diagnose. Thank you very much!

@ocxtal
Copy link
Owner

ocxtal commented Jun 24, 2019

It seems natural that minialign consumes ~4TB memory for 415GB reference (if it's not compressed; ~415Gbp). It depends on k and w, but avarage index footprint is around 11-13 times of the total reference sequence length in bytes. Larger w will help you decrease the footprint.

If it doesn't run when the reference is halved, it might be a bug. Then please let me know the log printed on your terminal, and input sequences if possible.

Thanks,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants