odelaneau / GLIMPSE

Low Coverage Calling of Genotypes
MIT License
139 stars 26 forks source link

Ask about header of `*rsquare.grp.txt.gz`, output of GLIMPSE2 concordance #235

Open Truongphikt opened 2 weeks ago

Truongphikt commented 2 weeks ago

Hi GLIMPSE2 team,

After processing the gettingstarted tutorial and reading the concordance_plot.py script, I realized that the 2nd and 4th columns of the* rsquare.grp.txt.gz file (the output of GLIMPSE2 concordance) are MAF bins and aggregated $R^2$, respectively. However, what are the headers of the 1st and third columns? They are the number of variants and NRC, aren't they? I tried to find them in the document but didn't find the header of the concordance's output.

Head of *rsquare.grp.txt.gz file

0 650586008 0.00323046 0.62754 0.668743
1 56607584 0.0236651 0.848334 0.869651
2 26213611 0.0410262 0.912369 0.927101
3 17361908 0.057932 0.932151 0.944779
4 13492571 0.0745543 0.944295 0.954775
5 10646295 0.0913464 0.945118 0.955798

Relate to the *rsquare.grp.txt.gz. I also wonder what is the header of the *rsquare.spl.txt.gz file.

Head of *rsquare.spl.txt.gz file

HG01589 0.980067 0.983804
HG01583 0.980381 0.984055
HG02784 0.983256 0.98649
HG02789 0.984328 0.987049
HG02493 0.981906 0.985343
HG02688 0.983168 0.986356
HG02733 0.981634 0.985403

Thanks.

Truongphikt commented 1 week ago

After referred #109, I had found some information: