Closed apriltuesday closed 9 months ago
Note that in this implementation, ref/ref genotypes have no consequence or gene annotated
Couldn't we use the reference_genome
term from SO. I assume getting the context gene would be fairly easy as well.
Couldn't we use the
reference_genome
term from SO. I assume getting the context gene would be fairly easy as well.
I was going to ask OT what they prefer but yes, we could get the gene & return a SO term (another possibility is no_sequence_alteration)
Closes #18 Better expected output diff here
Note that in this implementation, ref/ref genotypes have no consequence or gene annotated; these will be annotated in other genotypes associated with the same variant though. For example:
21_36070377_G_A,A
21_36070377_G_A,A
21_36070377_G_A,G
21_36070377_G_A,G
21_36070377_G_G,G
We might need a follow-up issue to modify this behaviour.
I've also added counts for multi-allelic variants as requested by OT, will post the numbers once I run the entire dataset but here's what the report looks like for the test set: