Closed ch-kr closed 4 days ago
adding a comment that I used freeze 1 for preliminary results; we can either plan to keep this data freeze (which means we'd have to update FREEZES
and CURRENT_FREEZE
in rmc.py
) or we can overwrite the results from this freeze when we are ready to rerun
PR contains changes needed to get RMC search pipeline to run on v4/GRCh38.
Major changes:
rmc/utils/generic.py
andrmc/utils/constraint.py
get_aa_from_context
to keep all of the annotations to actually extra AA information;rmc/utils/generic.py
rmc/utils/constraint.py
rmc/utils/constraint.py
rmc/utils/constraint.py
-- NOTE that this code hasn't been tested (updated after I finished running)Minor changes:
rmc/pipeline/regional_constraint.py
rmc/pipeline/regional_constraint.py
,rmc/utils/constraint.py
context_with_oe
creation from RMC results finalizing;rmc/pipeline/regional_constraint.py
rmc/pipeline/regional_constraint.py
transcript_ref
andtranscript_cds
creation (previous code was buggy);rmc/utils/data_loading.py
rmc/resources/reference_data.py
rmc/pipeline/two_breaks/run_batches_dataproc.py
rmc/pipeline/two_breaks/run_batches.py
TEMP_PATH_WITH_FAST_DEL
for temporary files generated that aren't needed;rmc/utils/simultaneous_breaks.py
naive_coalesce
>repartition
to avoid zip length mismatch error;rmc/utils/simultaneous_breaks.py
rmc/utils/mpc.py
due to removal of coverage filtering criteria fromkeep_criteria
rmc/utils/missense_badness.py
constraint_flag
field name toconstraint_flags
and additional filter toENST
transcripts only;rmc/utils/generic.py