Open jaamarks opened 4 years ago
OAall subset (UHS1 GWAS results) vs FOU subset (only UHS sample) Note that FOU with old UHS4 failed to complete.
FOU subset (UHS samples only with the newest UHS4) vs OAall all samples and OAall subset (deCODE contributing to OAexp)
Whole FOU vs OAall subset of UHS1 GWAS results
Note that COGA, Yale-Penn-CIDR, Yale-Penn-GO, and deCODE_OAexp did not finish due to errors during the correlation/heritability analyses. See logs below fou_087_by_coga.ldsc_regression.log fou_087_by_decode_oaexp.ldsc_regression.log fou_087_by_yalepenn_cidr.ldsc_regression.log fou_087_by_yalepenn_go.ldsc_regression.log
Remove VIDUS from both FOU and OAall
reference trait: FOU_097 (N=5,088) (no VIDUS)
compared to: OAall_098 (N=302,330) (OAall no VIDUS)
OAall_099 (N=29,443) (OAall no VIDUS and deCODE OAexp subset)
VIDUS FOU (N=300) vs VIDUS OAall (N=2177)
I tried to run FOU UHS1–4 (with old UHS4) compared to OAall UHS1, but again it failed with the same error. See below for N information.
Heritability of phenotype 1
---------------------------
Total Observed scale h2: -0.0466 (0.1796)
Lambda GC: 1.0345
Mean Chi^2: 1.0281
Intercept: 1.0304 (0.0062)
Ratio: 1.0828 (0.2207)
FOU_old_uhs4 -- UHS1(897) + UHS2-3(772) + UHS4(861) = 2,530 OAall -- UHS1(9,245)
○ 009: FOU_092 (without UHS4) vs FOU_087 and Oaall_UHS1 GWAS results and COGA
COGA continuing to failed
011: CATS_FOU (N=1226) vs CATSPERTHUNT_OAall (N=1920) & CATSMOLE_OAall (N=3162)
012: CATSMOLE_OAall (N=3162) vs CATS_FOU (N=1226) & CATSPERTHUNT_OAall (N=1920)
013: CATSPERTHUNT_OAall (N=1920) vs CATS_FOU (N=1226) & CATSMOLE_OAall (N=3162)
011 | 012 | 013 |
---|---|---|
We see from the attached spreadsheet that order does not matter in the LDSC regression pipeline, as expected. In particular:
The correlation between traits stays the same even when the reference trait and non-reference trait are switched. In particular:
20200211_cats_fou_vs_oaall.xlsx
cats_fou_by_catsmole.ldsc_regression.log catsmole_oaall_by_cats_fou.ldsc_regression.log catsperthunt_by_cats_fou.ldsc_regression.log
Yale-Penn FOU (N=850) vs Yale-Penn-CIDR (N=666) & Yale-Penn-GO (N=917)
Failed to run.
Heritability of phenotype 1 (FOU reference)
---------------------------
Total Observed scale h2: -0.934 (0.4784)
Lambda GC: 1.0046
Mean Chi^2: 1.0054
Intercept: 1.0207 (0.006)
Ratio: 3.821 (1.103)
Heritability of phenotype 2/3 (YP-GO)
-----------------------------
Total Observed scale h2: 0.0442 (0.5186)
Lambda GC: 1.0315
Mean Chi^2: 1.0323
Intercept: 1.0316 (0.0068)
Ratio: 0.9764 (0.2091)
Heritability of phenotype 3/3 (YP-CIDR)
-----------------------------
Total Observed scale h2: -0.398 (0.6178)
Lambda GC: 1.0135
Mean Chi^2: 1.0078
Intercept: 1.0129 (0.0061)
Ratio: 1.654 (0.7842)
Complete FOU_087(N=5,388) compared to OAall GWAS results:
Heritability of phenotype COGA
-----------------------------
Total Observed scale h2: -0.2964 (0.0533)
Lambda GC: 0.9898
Mean Chi^2: 0.9887
Intercept: 1.0337 (0.0064)
Ratio: NA (mean chi^2 < 1)
deCODE_oaexp
FloatingPointError: invalid value encountered in sqrt
Heritability of phenotype yale-penn_cidr
-----------------------------
Total Observed scale h2: -0.4177 (0.6015)
Lambda GC: 1.0135
Mean Chi^2: 1.0072
Intercept: 1.0126 (0.0059)
Ratio: 1.7479 (0.8206)
Yale-Penn_GO
FloatingPointError: invalid value encountered in sqrt
For this analysis we compared the OAall cohorts by running a PCA of the union set of top SNPs from each cohort. In particular, we extracted the top SNPs (with P<10E-4 ) from each OAall cohort and then took the union set of those SNPs with which we performed a PCA. Note that we removed any SNPs from the union set that were not present in all cohorts. The OAall cohorts were:
deCODE OAall and deCODE OAexp were the exact same in this model. This is because the MAFs were the exact same for both phenotypes, despite the difference in sample size. This should be noted because it will affect which list of SNPs to filter out due to the MAF threshold. These data were retrieved from:
OAall: s3://rti-midas-data/studies/ngc/decode/association_tests/001/ea/oaall/decode.ea.oaall.chr{1..22}.1000g_ids.maf_gt_0.01_eur_decode.beta_se.rsq_gt_0.3.gz
OAexp: s3://rti-midas-data/studies/ngc/decode/association_tests/002/ea/oaexp/decode.ea.oaexp.chr{1..22}.1000g_ids.maf_gt_0.01_eur_decode.beta_se.rsq_gt_0.3.gz
Raymond Walters described here that lack of polygenic signal can be the culprit for floating point errors.
that error indicates that there are negative estimates of heritability observed for one of the traits when computing the jackknife standard errors, which normally this indicates that at least one of the input results has very low heritability
For comparison, LDHub restrict to looking at genetic correlation in GWAS with heritability Z-score (h2g / SE) above 2, and the original Nature Genetics paper restricted to Z-score > 4, compared to .0746/.0656 = 1.14 for your results.
Also see this GitHub issue
This generally means the heritability of at least one of the traits is quite small and/or unstable (e.g. due to small sample size
it’s possible that there simply isn’t enough signal in your data for a reliable genetic correlation analysis with ldsc (which would be consistent with the error message you are seeing).
We normally recommend against LDSR genetic correlation analyses when the univariate h2 results for either phenotype isn't clearly significant (e.g. z-score > 4).
COGA Total Observed scale h2: -0.2965 (0.0534) Mean Chi^2: 0.9887
deCODE_OAexp Total Observed scale h2: 0.0385 (0.1841) Mean Chi^2: 0.9875
Yale-Penn CIDR Total Observed scale h2: -0.4239 (0.6054) Mean Chi^2: 1.0072
Yale-Penn GO
Total Observed scale h2: 0.1106 (0.5307)
Mean Chi^2: 1.0319
Q. Why is my total h2 estimate negative?
A. Negative h2 is of course not meaningful, but negative h2 estimates can occur. This usually means that the true h2 is close to zero, and sampling error pushed the estimate below zero.
Check whether mean chi2 is above ~1.02. If no, this means there is very little polygenic signal for LDSC to work with.
FOU meta106 (N=4,592) vs OAall meta105 (N=18,825) Meta-analyses excluding the cohorts which ran LMM.
Heritability of phenotype 1
---------------------------
Total Observed scale h2: 0.0875 (0.1109)
Mean Chi^2: 1.0263
Heritability of phenotype 2/2
-----------------------------
Total Observed scale h2: 0.1451 (0.0221)
Mean Chi^2: 1.0299
Note 0.0875/0.1109 = 0.735 which is indicates that the heritability estimate Z-score is too low.
LDHub restrict to looking at genetic correlation in GWAS with heritability Z-score (h2g / SE) above 2, and the original Nature Genetics paper restricted to Z-score > 4
As a rule of thumb, LD Score regression tends to yield very noisy results when applied to datasets with fewer than ~5k samples
Same as 017 except compare to individual level OAall GWAS results.
VIDUS_FOU (N=300) vs OAall_meta individual cohorts.
VIDUS_OAall meta (N=2177) vs FOU GWAS of individual cohorts
We next plan to combine the UHS2–3 & UHS4 data and perform an FOU GWAS. We will then perform a heritability estimate (hg2 ) on those results using the LDSR tool, and also using GREML (as implemented in GCTA). One caveat is that GREML requires the raw genotype data of each individual—so we can't use summary stats.
VIDUS OAall (N=2177) UHS2-4 (N=1562)
UHS2-4 had a negative heritability, even though when ran separately they did not. You see that UHS2-3 had very low h2 with large se and same for UHS4.
cohort | N | h2 | se | zscore |
---|---|---|---|---|
uhs2-3 | 772 | 0.03 | (0.6457) | 0.0464 |
uhs4 | 1067 | 0.2717 | (0.386) | 0.703 |
Description: The frequency of use (FOU) phenotype for NGC continues to be flipped in the opposite direction from the other NGC phenotypes (OAall and OAexp). See figures in GitHub issue 140 in the December 16, 2019 comment. We are going to investigate this issue by performing an LDSC regression on a few different combinations of phenotypes.
[x] 001: OAall_UHS1 GWAS results (N=9,245) vs FOU_old_UHS4 (N=2,530) and FOU_new_UHS4 (N=2,736)
[x] 002: FOU_new_UHS4 (N=2,736) vs OAall_094 (N=31,620) and OAall_089 (N=304,507)
[x] 003: OAall_UHS1 GWAS results (N=9,245) vs FOU_087 (N=5,388)
Date Locations: NGC summary stats results location:
Note: The effect allele for the FOU analyses is the same as for the other NGC metas (A2). The effect allele for the OAall GWAS analysis is ALT.
See also
https://s3.console.aws.amazon.com/s3/buckets/rti-heroin/ldsc/data/