chhylp123 / hifiasm

Hifiasm: a haplotype-resolved assembler for accurate Hifi reads
MIT License
539 stars 87 forks source link

Job keeps failing due to time limit of 7 days enforced by cluster. #712

Open JohnUrban opened 2 weeks ago

JohnUrban commented 2 weeks ago

Hi,

As always, thank you for your tools, innovation, and help on issues.

I've launched a HiFiasm job twice now. Both times it failed to complete in 7 days.

The first time it did finish making the bin files and such.

So the second time it was able to start from there.

However, from what I can tell, there were no additional files written or any new landmarks for HiFiasm to start from.

So, I have requested a relaunch, but I am not hopeful it will finish. I think it will just be a repeat of the first re-launch.

Some details:

Is there a stepwise set of commands I can run to help break this job up into smaller pieces that can each finish in 7 days or less?

Any guidance would be much appreciated.

Best,

John

JohnUrban commented 1 week ago

Well the job did not finish (again), as expected.

Well, I am now trying to give more memory and threads. Instead of 32, I am trying up to 80 threads.

Hopefully this can finish in 7 days. At the moment, the cluster says it won't even start for another 10 days though.

JohnUrban commented 6 days ago

Alright. So here is more background and my current solution.

Datasets:

Solution:

HiFiasm finished in a day with 100 GB memory allotted to it. This is in comparison to needing to allot 500-1500 GB memory and it not finishing even within 7 days.

Any further discussion on down-sampling would be appreciated.

And as for how this issue/insight could potentially enhance the user experience for HiFiasm: