Closed MattWellie closed 4 months ago
Driver job: https://batch.hail.populationgenomics.org.au/batches/461265/jobs/1 Worker Batch: https://batch.hail.populationgenomics.org.au/batches/461266
This run uses a different approach:
This test run was successful, but it was successful with a teeny weeny baby MT (~2MB total). Just a proof of concept, but a success.
Closing as superseded by #829
Closes #800
Tested here with a small Exome MT -> ES (2GB): https://batch.hail.populationgenomics.org.au/batches/461296
~Untested~
Process:
seqr-loading-pipelines
This is complete theft:
some optimisation params: https://github.com/broadinstitute/seqr-loading-pipelines/blob/c113106204165e22b7a8c629054e94533615e7d2/hail_scripts/elasticsearch/hail_elasticsearch_client.py#L196-L206 https://www.elastic.co/guide/en/elasticsearch/hadoop/current/configuration.html