Yves-CHEN / DENTIST

DENTIST (Detecting Errors iN analyses of summary staTISTics) is a QC tool for summary-data-based analyses.
GNU Lesser General Public License v3.0
21 stars 6 forks source link

Providing more information about the log #22

Open yanyul233 opened 1 year ago

yanyul233 commented 1 year ago

Hi! Thanks for developing DENTIST. As using the software and looking into interesting examples, it will be great if the log file is more readable. For instance, below there are some numbers in the log file and I'm keen to understand what they are. Thanks!

[warning] mogan is like to be missing, but it will only be a problem when calling for it.
[info] Calculating frequencies with 1 cpus
[info] This bed file is plink 1.0 bedfile format. (Older)
[info] Guessing the chrID.
[info] chrID == 18
[info] Aligning GWAS to the reference sample assumming both files are ordered.
[info] Performing DENTIST at 8512 SNPs shared between the summary and reference data.
2537-8512, 1275544, 0
[info]  At least 289 Mb of memory is required.
2537th - 8512th
..0.0%[info]This bed file is plink 1.0 bedfile format. (Older)
[info] Buffer size is 25 Mb.
0.0160886 0.00153092 0.0299531
sum: 620
sum: 1622
sum: 1236
sum: 3246
sum: 574
sum: 1628
sum: 1156
sum: 3246
sum: 554
sum: 1636
sum: 1136
sum: 3243
sum: 563
sum: 1617
sum: 1118
sum: 3240
sum: 552
sum: 1616
sum: 1096
sum: 3239
sum: 542
sum: 1615
sum: 1078
sum: 3235
sum: 546
sum: 1599
sum: 1056
sum: 3232
sum: 515
sum: 1620
sum: 1036
sum: 3232
Yves-CHEN commented 1 year ago

Hi thanks very much for your interesting and your important feedback.

This "sum:" is not intended for users. This has been removed from the new version of DENTIST (please try out our new version which is published on Oct/2023).

For the interpretation of the log, please go to the README of DENTIST.

Let me know if anything that is missed from instruction.

yanyul233 commented 1 year ago

Hi @Yves-CHEN, thanks for the updates! We've tried the new release.

Another puzzling question is what 2537th - 8512th means in the above example. More generally, we noticed that DENTIST will return results on such reported chunk like 2537th - 8512th and the rest of the variants are discarded somewhat silently. Thank you!