Cloufield / gwaslab

A Python package for handling and visualizing GWAS summary statistics. https://cloufield.github.io/gwaslab/
GNU General Public License v3.0
151 stars 25 forks source link

R2 threshold #28

Closed chenyangjjj closed 1 year ago

chenyangjjj commented 1 year ago

Hi Yunye,

Thanks for providing such a concise package. I am extracting the lead SNPs with mysumstats.get_lead. What is the default R2 threshold inside this function?

Best, Chenyang

Cloufield commented 1 year ago

Hi, Chenyang, Thanks for your question. get_lead() is simply based on physical positions (sliding window). https://cloufield.github.io/gwaslab/ExtractLead/ LD was not taken into account for this function.

chenyangjjj commented 1 year ago

Thanks for your reply! It is a different strategy from plink. Would you mind sharing some insights about this new definition? Does it performs same or better than commonly used one. any literature would also welcome, much appreciated!

Cloufield commented 1 year ago

Hi, The aim of this function is simply to check significant loci and get lead variants (the variant with the lowest P value for each locus). It is different from the idea of clumping in PLINK. You can also check the definition in this paper. Zhou, W., Kanai, M., Wu, K. H. H., Rasheed, H., Tsuo, K., Hirbo, J. B., ... & Study, C. O. H. (2022). Global Biobank Meta-analysis Initiative: Powering genetic discovery across human disease. Cell Genomics, 2(10), 100192.

chenyangjjj commented 1 year ago

Thanks for clarifying it!