wustl-oncology / cloud-workflows

Infrastructure and tooling required to get genomic workflows running in the cloud
1 stars 6 forks source link

Prototype reference disks for reference directories like VEP cache #14

Open johnmaruska opened 2 years ago

johnmaruska commented 2 years ago

Some tools use very large directories as a static input. Cromwell offers support for what they call Reference Disks. We could leverage these to quickly shuffle around the huge directories we need without constantly localizing them.

Mostly interested in this increasing speed, but it could potentially decrease cost either directly through the speed increase or getting to use cheaper disks for that