Closed yystat closed 3 years ago
Hi,
Could you check by testing on the odd looking chromosomes (chr3,12&20) whether you observe the same behavior with v2.0.2? You can use --chrList 3,12,20
to filter the analysis down to these chromosomes.
Thanks, I tried the newer version (2.2.2) and now the results look normal.
I really like Regenie since it's so fast. Empirically, it seems that it's less powerful than BOLT-LMM for continuous traits, right? Is there a way to boost the power? For binary traits, I compared Regenie vs. Plink, Regenie also seems to be less powerful at the known causal loci.
Thank you for your comments in advance.
The model used in Step.1 of Regenie is closely related to the infinitesimal model used in many GWAS methods to capture genome-wide polygenic effects and thus can obtain lower power for traits with sparser genetic architecture, for which a model which models sparsity would be better (such as the one in BOLT-LMM). This is something we are looking into. When comparing to PLINK, did you subset to unrelateds and use PCs in the model (as PLINK does not have a model to capture relatedness which can lead to inflation if unaccounted for)?
Thanks, that makes senses.
For binary traits, all my samples are unrelated but I also included PCs in covariates (for both Plink and Regenie). I think since Regenie can account for relatedness without PCs, adding PCs may adjust the population stratification too much and reduce the power (Plink uses PC vs. Regenie uses ridge regression + PC making the comparison unfair)?
There were bugs with v2.2-v2.2.3 with binary traits which were addressed on v2.2.4. That should explain the strange results observed with v2.2. It is standard practice for GWAS to include PCs as covariates and we also recommend that for Regenie; how different are the p-values between the two approaches (like differences in many orders of magnitude?)
Hello,
I'm using regenie (v2.2) on a binary trait (about 3K cases vs. 12K controls) and I got a very strange result. See the attached Manhattan plot below. It seems that all the SNPs from chromosome 3, 12, and 20 have p-values larger a fixed value (which seems to be 0.05). I also check the number of SNPs, these chromosomes do not seem to have unusually small number of SNPs.
Note, I also run the case-control GWAS using a subset of the case (so several hundreds cases vs. about 12K controls) and the results look normal.
Thank you very much for your help!
Here is a number of SNPs in each chromosome:
Here is the distribution of the P values from chromosomes 3, 12, and 20. Indeed the SNPs have a minimum P value of 0.05:
Here is my codes: