PharmGKB / PharmCAT

The Pharmacogenomic Clinical Annotation Tool
Mozilla Public License 2.0
120 stars 39 forks source link

Duplicated CYP2D6 diplotypes in matcher JSONs #184

Open BinglanLi opened 3 months ago

BinglanLi commented 3 months ago

bug

For some test samples, Matcher JSON has duplicate CYP2D6 diplotypes under the -research cyp2d6,combinations mode.

The duplicates can be found under CPIC-CYP2D6-diplotypes. The attached mock VCF will produce a Matcher JSON with two (2) *1/[*3 + *4 + *122]

This could be reproduced by running

docker run --rm -v ./:/pharmcat/data pgkb/pharmcat pharmcat -vcf data/mock_test.vcf -research combinations,cyp2d6

Only one unique diplotype should be reported.

The number of unique diplotypes from the Matcher JSON files helps determine the fields of the Phenotyper JSON I should process.