Removes inclusion of translated PGKB genes from main evidence string (possibly temporarily). This is due to complications aligning them with our VEP-provided genes when there are multiple per record being exploded, and based on info from PGKB directly that these genes are associated with the variant.
Fixes issues that arose from the test run and adds tests
Counts from test run (note VEP gene/consequence annotation counts are lowered as there are known issues in how we get variant coordinates to query VEP):
Total clinical annotations: 5073
With RS: 4477
Exploded by allele: 13497
Exploded by drug: 18830
Exploded by phenotype: 23086
Total evidence strings: 25475
With CHEBI: 21205
With EFO phenotype: 6830
With functional consequence: 22891
With VEP gene: 22891
Gene comparisons per annotation
With PGKB genes: 4220
With VEP genes: 4059
PGKB genes != VEP genes: 811
Counts from test run (note VEP gene/consequence annotation counts are lowered as there are known issues in how we get variant coordinates to query VEP):