Integrating boostrapping files from ADMIXTURE

ramachandran-lab / pong

Fast analysis and visualization of latent clusters in population genetic data

66 stars 11 forks source link

Hi Neha,

Great question! To generate multiple Q-files per K we actually run ADMIXTURE multiple times for each value of K. This is not strictly necessary -- you should still be able to use pong to visualize and analyze your existing data.

The reason it can be useful to generate replicate runs per K is to assess the robustness of the clusters inferred at each value of K. The stochasticity of ADMIXTURE's clustering approach means that replicate runs can produce distinct solutions even when the same initial conditions are used. These distinct solutions can result from real biological factors, and we refer to this concept as multimodality (Jakobsson and Rosenberg, 2007; Behr et al., 2016).

Hope this helps. Feel free to reach out if you have any other questions!

ramachandran-lab / pong

Integrating boostrapping files from ADMIXTURE #4