Closed evanrrees closed 5 years ago
Thanks for the report. I saw the sys
time increased very large, not sure whether it was caused by system or new code. I also found v2.3 run slower than v2.1 on huge dataset, but not too much. Anyway, I will test this problem.
Best, Jue
Thanks for looking into this. These were run on two different servers with the following specs, though I believe the only difference is greater RAM in the second run. v2.1: four 14-core (28-hyperthreaded) Intel Xeon E7 4850 v3 14C 2.1-2.8GHz, 12TB SATA HD (RAID10), 1TB SSD HD, 512GB RAM v2.3: four 14-core (28-hyperthreaded) Intel Xeon E7 4850 v3 14C 2.1-2.8GHz, 12TB SATA HD (RAID10), 1TB SSD HD, 1024GB RAM
Recently, I got a similar case. It was caused by every read have many alignments, processing the alignments block other threads. Now, I fix it at https://github.com/ruanjue/wtdbg2/tree/gcov , please have a try with it.
Now, branch gcov is merged into master, please checkout https://github.com/ruanjue/wtdbg2/commit/8908a3162362df842e540658a45222f764e629a7. It should answer your question.
Hi,
I have noticed substantially slower runtimes with wtdbg2.3 compared with 2.1 on the same dataset. I'm using 100 Gbp of ONT reads on a ~3 Gbp heterozygous plant genome (diploid size 6 Gbp).
With v2.1:
and v2.3:
So with v2.1, wtdbg2 and wtpoa-cns take ~7 hrs and ~4 hrs respectively, while in v2.3 they take ~31 hrs and ~13 hrs. One major difference I can see is that the alignments and ctg.lay outputs are compressed - could this be contributing to the slowdown?
Thanks!