Modifies how aa_mm_biopython_runmatch_genie_9loc.py is run to compute DR amino acid mismatches differently
Need to add a new function to aa_matching_msf_genie.py
There is an existing function called count_AA_Mismatches_Allele() takes an input a pair of HLA alleles from the donor and pair of alleles from the recipient for a single locus (e.g. HLA-A or HLA-DQB1) and a amino acid position and counts how many unique AA residues are in the donor but not the recip. If the donor allele is homozygous, the mismatch count is 1.
Make new version of count_AA_Mismatches_Allele() called count_AA_Mismatches_DR(), taking as input 4 donor alleles and 4 recip alleles
0-4 mismatches per DR position, treating DRB1,3,4,5 as the same gene with up to 4 copies. So a donor could have DRB1*15:01, DRB1*03:01, DRB5*01:01 and DRB3*12:01 and you would count how many unique AA residues are in the donor but not the recip.
Need different Python environments for different stepsaa_mm_biopython_runmatch.py requires pyARD which requires older version of Pandas
construct_outcomes_vars.py requires Pandas 2.0
DRB1 and DRB3/4/5 are different HLA loci but the proteins are very similar. DRB3/4/5 has copy number variation where an individual has at most 2 copies of a DRB3/4/5 gene.
Modifies how
aa_mm_biopython_runmatch_genie_9loc.py
is run to compute DR amino acid mismatches differentlyNeed to add a new function to
aa_matching_msf_genie.py
There is an existing function called
count_AA_Mismatches_Allele()
takes an input a pair of HLA alleles from the donor and pair of alleles from the recipient for a single locus (e.g. HLA-A or HLA-DQB1) and a amino acid position and counts how many unique AA residues are in the donor but not the recip. If the donor allele is homozygous, the mismatch count is 1.Make new version of
count_AA_Mismatches_Allele()
calledcount_AA_Mismatches_DR()
, taking as input 4 donor alleles and 4 recip alleles 0-4 mismatches per DR position, treating DRB1,3,4,5 as the same gene with up to 4 copies. So a donor could haveDRB1*15:01
,DRB1*03:01
,DRB5*01:01
andDRB3*12:01
and you would count how many unique AA residues are in the donor but not the recip.Need different Python environments for different steps
aa_mm_biopython_runmatch.py
requires pyARD which requires older version of Pandasconstruct_outcomes_vars.py
requires Pandas 2.0