Open adrianodemarino opened 1 year ago
So the error message is caused by a small bug which I will fix soon but the actual reason for crashing is that gemma seems to be unable to complete the creation of the relatedness matrix when using your full dataset. Did you try different subsets of your VCF file to see if it is only a certain subgroup of SNPs that is causing the issue?
I didn't try with a different subgroup of SNPs. I only sub grouped the number of samples used (in one case 10 samples and in another one 5000 samples) and in both cases it worked perfectly. Could be that due to the amount of samples (~500k), the relatedness matrix wants more than only 15724 SNPs ?
I never read about that being an issue, maybe you could subset your sample size further to get an upper limit of how many samples gemma is able to use?
I will try to do that and let you know, thanks!
I get this error when my VCF file it contains a lot of samples.
command executed:
vcf2gwas -v chr20.Haplotypes.vcf.gz -pf height_data_ukbb_participant.csv -p height -cf covariants_nokinship_data_ukbb_participant.csv -c sex -lmm
run:
If I try to run the same exactly command using only a subset of samples, it works perfectly. I get the same error if instead of using option
-c sex
I use-ac