Open kimjr89 opened 4 years ago
Hi @kimjr89,
Interesting result - I have no immediate explanation for this.
This is DNA sequencing data from normal tissue, i.e. e.g. not from tumor tissue?
Regarding the column names: ClusterID = Diploid G group allele cluster P = normalized posterior probability LL = log likelihood (which P is calculated from) Mismatches_avg = A measure of the read mismatches observed for the allele group cluster. Smaller is better in principle, but unclear whether this metric is really informative.
Hi.
I've got discordant results from different tissues of the same person using HLA-LA. The results are all the same except in DPA1 region: one is DPA102:02:02 and the other is DPA101:11. Though average coverage is lowest in DPA1 region than other regions, it's above 30X.
I attached the result files. AA1_P3_R1_bestguess_G.txt AA1_BM_R1_bestguess_G.txt
I investigated the intermediate files and found that there are summary stats prioritizing likely pairs in "R1_pileup_DPA1.txt" file.
I also attached "R1_pileup_DPA1.txt" files. AA1_BM_R1_PP_DPA1_pairs.txt AA1_P3_R1_PP_DPA1_pairs.txt
Can you explain the meaning of the column names (ClusterID, P, LL, and Mismatches_avg) and possible answers for the discordant results?
Thank you!