PharmGKB / PharmCAT

The Pharmacogenomic Clinical Annotation Tool
Mozilla Public License 2.0
120 stars 39 forks source link

confused by rs4124874 and rs12777823 variants #47

Closed mathob closed 3 years ago

mathob commented 3 years ago

bug

pharmcat_positions_0.8.0.fixed.vcf includes two deletion variants whose representation seems contrary to what is spefified for Deletions on the "VCF Requirements" page

java -cp pharmcat-0.8.0-all.jar org.pharmgkb.pharmcat.definition.ExtractPositions -o pgx.vcf cat pgx.vcf | grep -e rs4124874 -e rs12777823 chr2 233757013 rs4124874 T . . PASS PX=UGT1A1:*1[5]isT GT 0/0 chr10 94645745 rs12777823 G . . PASS POI GT 0/0

variants representation with ALT allele not a dot ??

testing PharmCat and wishing to understand messages warning of "missing" variants

mathob commented 3 years ago

forget this please - sorry to bother you

whaleyr commented 3 years ago

No worries, this is actually a good question.

You may have already seen but rs4124874 has recenlty been removed from the UGT1A1 translation on the development branch so it won't be a position reported in future versions.

rs12777823 is what we consider an "extra" position. It's not actually used when matching defined haplotypes but it is a position that curators feel should be included when reporting genotypes. Since it's not used in a haplotype definition we don't have an "ALT" that we expect for that position, and that's what we use to determine what goes in the ALT for that file.

I hope that clarifies things a bit.

mathob commented 3 years ago

Yes, that all makes sense - thanks for taking thetime to reply, Ryan.

On 23 Apr 2021, at 3:47 am, Ryan Whaley @.**@.>> wrote:

No worries, this is actually a good question.

You may have already seen but rs4124874 has recenlty been removed from the UGT1A1 translation on the development branch so it won't be a position reported in future versions.

rs12777823 is what we consider an "extra" position. It's not actually used when matching defined haplotypes but it is a position that curators feel should be included when reporting genotypes. Since it's not used in a haplotype definition we don't have an "ALT" that we expect for that position, and that's what we use to determine what goes in the ALT for that file.

I hope that clarifies things a bit.

NOTICE Please consider the environment before printing this email. This message and any attachments are intended for the addressee named and may contain legally privileged/confidential/copyright information. If you are not the intended recipient, you should not read, use, disclose, copy or distribute this communication. If you have received this message in error please notify us at once by return email and then delete both messages. We accept no liability for the distribution of viruses or similar in electronic communications. This notice should not be removed.