Based on #21 we decided the best course of action was to improve our genotype ID algorithm to fill in these missing genes. The most reliable method was to query dbSNP to get Canonical SPDI representations and use these to determine reference vs. alternate alleles.
Full algorithm:
Use PGKB location to try to locate reference allele among annotated alleles
If not successful, query dbSNP to get SPDI representation
Normalise using this algorithm or existing implementation
Use this to determine which are the alternate alleles
Based on #21 we decided the best course of action was to improve our genotype ID algorithm to fill in these missing genes. The most reliable method was to query dbSNP to get Canonical SPDI representations and use these to determine reference vs. alternate alleles.
Full algorithm: