Closed nickhir closed 3 years ago
Hi,
I'm not sure why this happens, but N=92 is a tiny number compared to most GWAS. The fact that you got some SNPs with small p-values (p~1e-6) with such a small sample size, suggests that you have a SNP with a huge effect size. Also, using in-sample LD with N=92 and with so many SNPs probably leads to severe noise in the LD estimates.
If this is a European sample, I suggest to try using the UKB LD data that we published online, instead of in-sample LD estimates (see the wiki for details) . I'm not sure this will solve all problems, but it would be a step in the right direction.
The only other alternative I can think of is to significantly decrease the window size, so that you include only ~20 SNPs at least, around the most promising position. It's not ideal, and it's a form of searching under the lamplight, but I think it should help address some of the technical limitations in having a very small sample size.
Best,
Omer
I am using the precomputed prior causal probabilities from the UK Biobank to annotate my GWAS SNPs that originate from a purely European population. All SNPs overlap and the
extract_snpvar.py
runs without a problem. TheSNPVAR
values range from4.0733e-09
to4.0733e-07
.Afterwards I am using Susie to perform the actual fine mapping step using the following command.
The
full_dataset_2
is the dataset in PLINK format that I used to perform the initial GWAS.The
finemap_result
shows that both thePIP
and theCREDIBLE_SET
column contain only 0.Do you have some insights why this could be the case?
The significant SNPs for which I want to find the causal SNP are these:
To identify the causal SNPs I included all SNPs that are located within a 5MB window upstream and downstream of these variants , so I ended up with 21413 variants in total. Could the window size be a problem?
Or could the problem be, that the p values I got in my initial GWAS analysis are comparatively big? I only have 92 individuals for my study, so I thought that it is unlikely to see extreme p values such as 10^-20, so I just included the most significant SNPs.
Any help is much appreciated!