This PR includes various bug fixes for running RMC pipeline on freeze 7 including:
Fixing use of prepare_ht_for_constraint_calculations, adding an option to switch to adding annotations in process_context_ht and in process_vep. This was to avoid a bug later which was trying to read a non-existent column in HT.
Move another annotation function into process_vep.
Fix a typo in create_filtered_context.
Add a toggle for building plateau models or use the existing models if available.
Fix model naming to avoid inconsistencies.
Fix a bug in resource path to plateau model.
Update Hail in docker file.
Change number of partitions for performance improvement.
Adding additional checkpoint in process_sections and fix some logic.
Remove gnomad and gnomad_qc from docker and fix python versioning in Dockerfile
Update search_for_two_breaks function to use chisq annotation throughout instead of max_chisq.
Update default docker image path to reflect newer image.
remove wait=False from b.run() in run_batches.py.
Temporary workaround changes due to a hail bug and updating docstring for future to-do in run_batches.py.
This PR includes various bug fixes for running RMC pipeline on freeze 7 including:
prepare_ht_for_constraint_calculations
, adding an option to switch to adding annotations inprocess_context_ht
and inprocess_vep
. This was to avoid a bug later which was trying to read a non-existent column in HT.process_vep
.create_filtered_context
.process_sections
and fix some logic.gnomad
andgnomad_qc
from docker and fix python versioning in Dockerfilesearch_for_two_breaks
function to usechisq
annotation throughout instead ofmax_chisq
.wait=False
fromb.run()
inrun_batches.py
.run_batches.py
.