sa-lee / starmie

starmie: plotting and inference for population structure models :star2:
Other
12 stars 6 forks source link

Estimate the best K #4

Closed gtonkinhill closed 8 years ago

gtonkinhill commented 8 years ago

A function that runs the method of Evanno and plots ln(Pr(X|K) values in order to identify the k for which Pr(K=k) is highest (as described in STRUCTURE's manual, section 5.1)

sa-lee commented 8 years ago

I also think an interesting possibility would be to pool results from various tools here. Not exactly sure how to go about this at this stage.

gtonkinhill commented 8 years ago

clumpak does something like that, although I haven't looked at it yet

sa-lee commented 8 years ago

for admixture is this the change point in the CV graph?

gtonkinhill commented 8 years ago

I think we can close this now, unless we want to put something in for Admixture?

sa-lee commented 8 years ago

looks good. i'll add something in for admixture before closing that uses admixtures cross-validation method to select the min.

sa-lee commented 8 years ago

I've added in admixture CV graph. Also throws error if the user hasn't added a log file.

With the repeated runs for STRUCTURE would it be possible to do something akin to https://en.wikipedia.org/wiki/Wilcoxon_signed-rank_test for model selection?

Also I don't know how mathematically sound this would be... but if you're doing repeated structure runs kinda akin to doing a 'bootstrap' like procedure ---> could do likelihood ratio tests?