PankratzLab / DonorCheck

Utility for validating donor typing entries to DonorNet
2 stars 0 forks source link

Add support for LinkSeq XML #4

Open hinerm opened 5 years ago

hinerm commented 5 years ago

Referenced here.

XML Sections to parse

It looks like the XML has an labAssignmentSection block at the top which includes the assigned typings. These can be read and passed to a ValidationModelBuilder to populate the core typing data.

For haplotype information, there is a testResultsSection that has all the possible alleles for each locus, which will be used to populate haplotype information.

Implementation Notes

Goals

hinerm commented 5 years ago
hinerm commented 5 years ago

The XML hierarchy of the haplotype allele storage for a single locus is:

The hlaCallList can contain multiple hlaTestCalls:

image

When there are multiple entries like this, we should look for a this was MANUALLY chosen flag in the footnote list:

image

and that is the hlaTestCall we should pull allele information from.

If a testCallList contains multiple hlaTestCalls and none have that footnote, just skip the check including that locus (e.g. if there are multiple C calls and none were manually selected, we wouldn't compute any B-C haplotype probabilities)