stephenslab / mashr

An R package for multivariate adaptive shrinkage.
https://stephenslab.github.io/mashr
Other
88 stars 19 forks source link

Question about the methodology for selecting strong datasets #116

Open harrsha4 opened 1 year ago

harrsha4 commented 1 year ago

Hello, Just as a followup to this thread, when you say the strong set was the lead eQTL for 20,000, do you mean within a tissue or amongst all tissues considering you were training on GTEx data? I am currently working with fastqtl results from 5 conditions (5 conditions x 33 million eQTL features) and used all lead SNP-eGenes from each condition (13,000 X 5 = 65,000) for the strong dataset with a 1 million random subset for the random data.

Thanks for creating this resource!

Harrsha

pcarbo commented 1 year ago

@harrsha4 In this vignette we suggesed one possibility: "Select the strong signals as those with lfsr < 0.05 in any condition in the 1by1 analysis."