broadinstitute / genetic-prevalence-estimator

https://genie.broadinstitute.org/
BSD 3-Clause "New" or "Revised" License
1 stars 0 forks source link

Use hailctl to start Dataproc clusters #70

Open nawatts opened 2 years ago

nawatts commented 2 years ago

Because of an issue with hailctl configuration in Hail's Docker image (see Zulip thread), we use gcloud to start a Dataproc cluster.

https://github.com/broadinstitute/aggregate-frequency-calculator/blob/fa55b0beda132977eba115e80bf907fdecd465a5/terraform/build-data-pipelines.tf#L78-L110

We should be able to work around this issue by re-installing Hail from PyPI in the Docker image:

HAIL_VERSION=$(pip show hail | grep Version | sed 's/Version: //')
pip install --force-reinstall --no-dependencies hail==$HAIL_VERSION