sanger-tol / treeval

Pipelines for the production of Treeval data
https://pipelines.tol.sanger.ac.uk/treeval
Other
22 stars 4 forks source link

mummer optimisation #199

Closed yumisims closed 8 months ago

yumisims commented 9 months ago

mummer has limitations (size) on both reference and query sequences.

(maximum reference length is 2305843009213693948) (maximum query length is 18446744073709551615)

although the above leads to no error warning on input but would lead to huge memory consumption on the mummer step. (Nummer requires approx 30 * reference size RAM to start the job). To avoid this problem, an additional channel to group reference sequence at max 1G per chunk to result in more efficient performance.