bodkan / admixr

An R package for reproducible and automated ADMIXTOOLS analyses
https://bodkan.net/admixr
Other
28 stars 9 forks source link

How does admixr calculate stats for populations (multiple individuals belonging to same group)? #71

Closed jychoilab closed 4 years ago

jychoilab commented 4 years ago

Hi

Very sorry for a question like this but could you clarify how for instance for d() is calculating D-statistics when the .ind file contain populations (groups of individuals with the same population classification) and not a single individual per population classification?

Thank you

bodkan commented 4 years ago

Hi,

f-statistics (and by extension also D statistics) work with allele frequencies. So there's really no difference in how the statistics are calculated in these two cases, a single individual can be understood simply as population of size one - with possible allele frequencies 0, 0.5 or 1.

For details about the theoretical properties of these statistics and methods how they are calculated I recommend reading papers by Patterson et al. and Peter.

I will close this issue now because this is not really a bug report. If you'd like to discuss the theoretical properties of f-statistics, I recommend contacting the authors.

Hope this helps!