Open laleoarrow opened 2 months ago
The sample size is relevant to z-scores, ie, how many samples are used to compute the association summary statistics. It should not have to do with the sample size for LD reference panels.
Correct, N is the sample size for the assocation statistics (e.g., z-scores).
@leoarrow1 If the documentation is unclear in coloc, please post an issue on the coloc GitHub, and feel free to reference this discussion here.
Thanks for the confirmation.
Hi all, I found the
n
in the source code of the coloc package’s susie function, where there is a section that handles the n parameter for sample size like:If Im getting this right, it’s should be that the n=samplesize (for gwas summary data) parameter is only applicable when an in-sample LD matrix is used. When the LD matrix is inferred using a reference panel, such as the 1kg panel, then
n
should represent the sample size of the reference panel. For instance, if the LD matrix is calculated using the European subset of the 1kg reference individual data, n should be approximately 500?