ComparativeGenomicsToolkit / cactus

Official home of genome aligner based upon notion of Cactus graphs
Other
481 stars 106 forks source link

Toilify vcfwave #1404

Closed glennhickey closed 2 weeks ago

glennhickey commented 2 weeks ago

Previously vcfwave was run using a bash script I had kicking around. This made integration into Cactus pretty quick, but wasn't ideal for debugging, since Toil was calling a bash script that in turn was running a bunch of piped commands through parallel. It also didn't support distributed computing.

So this PR re-implements the whole process directly in Toil, which should resolve the above two issues. I'll test it on some large inputs on the cluster before merging (I think the memory resource specs will need some tuning).