alekseyzimin / masurca

GNU General Public License v3.0
239 stars 35 forks source link

masurca 3.2.5 #7

Open ovidp opened 6 years ago

ovidp commented 6 years ago

I am running masurca for a (highly homozygote) plant genome of 1.3 Gb on a cluster with 2 Tb RAM and 90 cores. I have 100x Illumina coverage and ca 12x PacBio. Masurca is already running for 8 days and it predicted 89000 overlap jobs, that are running at the speed of ca 100/hour. I have 2 questions: 1) at this speed one can predict only the overlap jobs will take 36 more days. Is this something that is expected for this configuration and genome size? It is important that I dicuss with the cluster administrators if that is the case. 2) I am running v3.2.5 for 8 days and now I noticed in a previous post you do not recommend that. Shall I stop and revert to 3.2.4? Is there any way I can still use the files already outputed by 3.2.5? Thanks a lot

yyx8671 commented 6 years ago

Hi, I have also been using 3.2.5. Can you please tell why it is not recommended (seems you have moved 3.2.5). Cheers, Andy

alekseyzimin commented 6 years ago

The 3.2.5 did not have any code that had a major bug or was wrong, it was simply inefficient in the implementation of the new features. You can continue using 3.2.5 and your results will be valid. I decided that 3.2.5 was not "better" than 3.2.4 and thus I pulled to streamline and clean up the code for 3.2.6 release.