PharmGKB / PharmCAT

The Pharmacogenomic Clinical Annotation Tool
Mozilla Public License 2.0
120 stars 39 forks source link

Calling CYP2D6 as *4/*4 instead of *4+*10/*4+*10 #189

Closed msilver727 closed 1 month ago

msilver727 commented 1 month ago
BinglanLi commented 1 month ago

Thank you for reporting the issue. We are investigating it.

BinglanLi commented 1 month ago

Based on your input VCF, your sample is indeed 4/4. You are right that rs3892097 is the core allele-defining variant for 4. However, rs1135840 and rs1065852 are also allele-defining positions for all 4 sub-alleles but one and because of the exceptional 4 sub-allele, rs1135840 and rs1065852 are not included in the PharmVar 4 core (see PharmVar CYP2D6 gene page and PharmVar criteria page). The CPIC/PharmGKB CYP2D6 allele definition table is based on PharmVar core alleles. In addition, variants that are present in the core allele for one star allele but also present in some but not all suballeles of another star allele are included in the other star allele as ambiguous changes using the IUPAC nucleotide code (see the Notes tab on the CYP2D6 Allele Definition Table). Currently, PharmVar defines 175 CYP2D6 alleles. rs1135840 and rs1065852 are part of 21 CYP2D6 core star alleles and are further included in suballeles of 3 star alleles (4, 56, and *150).

Therefore, CYP2D6 *10 is defined by the presence of rs1135840 and rs1065852 and the absence of variants defining other star alleles.

For more on how PharmCAT determines a PGx allele, please visit https://pharmcat.org/methods/NamedAlleleMatcher-101/.

For more on calling CYP2D6 from VCF including limitations (e.g., structural and copy number variation have a large influence on inferring CYP2D6 phenotype, but are beyond the scope of what can be called from SNPs or INDELs in a VCF file), please visit https://pharmcat.org/using/Calling-CYP2D6/.