populationgenomics / hail

Scalable genomic data analysis.
https://hail.is
MIT License
1 stars 1 forks source link

Run a scalability test on the Australian Hail Batch instance #2

Closed lgruen closed 3 years ago

lgruen commented 3 years ago

The idea is to verify whether we can scale to substantial dataset sizes. We'll probably run into issues around GCP quotas and other issues.

For example, we could run a burden test on the UK Biobank data (if mirroring the necessary data turns out to be feasible).

lgruen commented 3 years ago

We'll get to this automatically with the upcoming QC tests.