ComparativeGenomicsToolkit / cactus

Official home of genome aligner based upon notion of Cactus graphs
Other
526 stars 111 forks source link

use exact disk estaimte for merge_alignments job #1497

Closed glennhickey closed 1 month ago

glennhickey commented 1 month ago

Someone ran into a case where the merge_alignments() job was asking for 400Gb of disk, which was too much for their cluster.

It looks like this job is using a rough estimate of size(chained(alignments)) ~= 5*(size(alignments)) which seems like it could be conservative, so this PR changes merge_alignments to exactly compute its own disk requirement, like some other jobs already do.