Open aneilbaboo opened 6 years ago
Hi @aneilbaboo
I grabbed a pair of input (23&Me) and output (VCF) from the S3 buckets that my test make targets populate. I randomly spot-checked a number of the categories above, comparing not just the file entries against one another looking for conversion errors, but also checked against dbSNP to make sure that chromosome, position, rsID, gene annotation, and reference base information looks correct.
I didn't find any errors. The only oddity that I found is that Beagle seems to output an extra line for X chromosome variants when the subject is male. I'll debug that issue this week, along with adding some high-throughput sanity checking of the entire input & output files.
@taltman thanks for the update. So it's duplicating male X-linked variants?
Create one or more input 23andMe file which encodes several specific situations.
Test corresponding rows of the imputed VCF:
Expected cases:
Unexpected cases: