--fastGWA-mlm-binary: segmentation fault at sparse grm reading step

jianyangqt / gcta

GCTA software

GNU General Public License v3.0

87 stars 26 forks source link

Hi, Do you mean that there are ~5m lines in your .grm.sp file? This is not expected for a 26k dataset. Even for the UK Biobank based on our calculation the number of lines in the .grm.sp is ~ 600k (restricted to European-ancestry participants only).

Please can you check the following things to make sure the sparse grm is correctly calculated?

Are all the individuals of the same/similar genetic ancestry ? (individuals from a different ancestry or Admix individuals should be removed)
Are you using HapMap3 common SNPs (minor allele frequency >= 0.01) to generate the GRM? (rare SNPs should not be used and also make sure the HapMap3 SNP list is from the corresponding ancestry of your data.
Are you using a sparse grm cutoff of 0.05? (setting the cutoff to values lower than 0.05 will increase the number of related pairs/rows in your .grm.sp file)

jianyangqt / gcta

--fastGWA-mlm-binary: segmentation fault at sparse grm reading step #79