hail-is / hail

Cloud-native genomic dataframes and batch computing
https://hail.is
MIT License
974 stars 243 forks source link

[vds] `vds.split_multi` needs to fix up LGT/GT #14694

Closed chrisvittal closed 1 week ago

chrisvittal commented 1 week ago

I can add more if necessary, I intend to fix this soon and don't want to forget. While adding GT to reference data, we try to set it such that the fields for the genotype are consistent across reference and variant data. split_multi currently only splits the variant data, but splitting drops LGT so if LGT is present in reference data, then we should make it GT.