Closed NathanSkene closed 2 years ago
Reduced from 11 million --> 400k SNPs. Much faster to download and still produces results with MAGMA. Filtered by MAF @ 5% and a nominal p-value of .05.
path_formatted <- MAGMA.Celltyping::get_example_gwas(
trait = "fluid_intelligence",
munged = TRUE)
ss <- data.table::fread(path_formatted)
ss2 <- ss[MINOR_AF>=.05 & P<.05,]
I've now filtered both examples ("fluid_intelligence" and "prospective_memory"), and they can be accessed via the MAGMA.Celltyping::get_example_gwas
function.
They're both now <40Mb (close enough!)
Could try filtering by frequency?
Reducing to HapMap3 SNPs.... I think thats what makes LDSC munged sumstats files so small
I'd try doing those and check if results look similar