-
Notifications
You must be signed in to change notification settings - Fork 29
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
One "untip" job does not finish in time. #291
Comments
Well, it failed as expected. For some reason, last week when I was trying I wanted to ask for more CPUs, but thought it was causing the silent failure. Today (after the job failed), I tried So, we will see if adding more CPUs and more memory helps it finish within 7 days. |
What is the input data for the assembly? |
HiFi, UL-ONT, and Hi-C. 300 GB genome (600 GB diploid). There may be an endosymbiont present with a similar genome size. There is >300X HiFi, ~120X ONT. |
I think with that much HiFi coverage you might be getting lots more noise nodes than expected, overcomplicating the graph (e.g. recurrent errors creating realistic coverage nodes). I'd suggest either downsampling to 100-ish HiFi or increasing the |
Thanks. Yes, this is a slightly weird situation where two experimental labs generated data from the same sample, and both want their data to be included in the assembly. I actually quoted the amount of HiFi from one lab. There is 500-600X from both labs. I will try |
Note - at the moment, I cannot say whether this issue is resolved. Another issue arose now where SLURM is not behaving well with Verkko.
Verkko worked fine on the same cluster in early September, and I am guessing something changed with the cluster.... That is to say, I don't think it is a Verkko issue, and I have opened an issue with the cluster people to resolve this. |
Update on recent SLURM issue. The SLURM back-end problem is now solved. They made some changes last week, and certain environmental variables were not adjusted, causing communication/connection issues (seen above). I will get back to you on |
Hi again,
Thanks as always for the tool and help.
There is a single step 5 "untip" job that will not finish. Verkko has tried up to 6 days. I just re-launched Verkko instructing it to try 7 days (168 hours). I also added more memory to the request in a feeble hope of it helping. We will see.
That job will end next Wednesday, but I am not hopeful that it will finish. The problem is that the cluster does not allow requests longer than 7 days. So I am wondering if there is a way to further break this job down into two or more jobs. I tried telling it to request more CPUs, but that seemed to be a non-starter. Verkko complained about Rukki paths and quit.
I can give more details of course, but I guess the "big picture" here is just whether or not there will be a way around the time limit thing.
Many thanks,
John
The text was updated successfully, but these errors were encountered: