Closed filosi closed 1 year ago
It means there are too many non-zero correlations being stored. Are you using a 3cM window?
Yes indeed
And you have this issue with 200,000 variants?
With all the chromosomes with > 200k variants, yes.
Yes, you cannot use too many variants (e.g. 400K per chromosome); this would give you too many non-zero correlations. This has been discussed in several issues here.
What do you want to use it for? Prediction? Something else?
I just need to use in prediction mode for now.
Yes, so it is not recommended to use either LDpred2 or lassosum2 with so many variants for now. Please use either HapMap3 or HapMap3+ variants as recommended in the tutorial.
When computing a large LD_matrix I got into the warning message:
producing an invalid
dsCmatrix
withNA
values in thep
slots. This is produced by thesnp_cor
function, and I believe the stacktrace comes from thebigsnpr/R/corr.R:46
line of code.In my case it happened with matrix with more than 200000 rows and columns. After QC on the genotype data and gwas data the matching variants are > 6ml, and that's why a huge LD matrix should be computed.