broadinstitute / gatk-sv

A structural variation pipeline for short-read sequencing
BSD 3-Clause "New" or "Revised" License
170 stars 70 forks source link

Optimize Scramble #603

Closed mwalker174 closed 1 year ago

mwalker174 commented 1 year ago

This PR incorporates several updates to the Scramble tool and workflow that substantially improve performance and reduce costs on GCP.

Final cost estimates are forthcoming but estimated at <$0.10 per sample. We should experience fewer failures and long-running outliers in the future.

Identical output was confirmed on the 1KGP reference panel (on primary contigs).

VJalili commented 1 year ago

Thank you, @mwalker174! This looks very good, and I'm excited to see we're getting closer to a better Scramble integration that can replace the restrictive alternative.

Out of curiosity, is there any reason the scramble docker was not added to build_docker.py?