PharmGKB / PharmCAT

The Pharmacogenomic Clinical Annotation Tool
Mozilla Public License 2.0
120 stars 39 forks source link

Erroneous Ref allele in the #35

Closed BinglanLi closed 3 years ago

BinglanLi commented 3 years ago
whaleyr commented 3 years ago

@BinglanLi Is this only happening for this deletion in DPYD or does it also happen for other deletions like 3531_3540delAGAAATGGAA in CYP2C9?

BinglanLi commented 3 years ago

I didn't see any issue with 3531_3540delAGAAATGGAA in CYP2C9 in the VCF file. There was no tandem repeat or similar sequence via a visual inspection of 100bp flanking region of that indel on UCSC.

rs72549309 is the only one:

However, just as future reference, I indeed found several additional incorrect indels entries in the VCF. But it's not worth reporting them in details as they are not allele defining positions after switching to CPIC. These include:

whaleyr commented 3 years ago

Pushed another change for the ExtractPositions class. I re-checked all the RSIDs mentioned in this issue and none of them are in the allele definitions anymore so they are no longer showing in the extracted pharmcat.vcf file. I'm going to close this issue since it's a moot point but if there's problems with positions or alleles in the data generated by the new ExtractPositions class please make a new issue.