Open bpbentley opened 3 years ago
Hi Blair,
Yeah, if you have 12 populations then I would try and tweak the -e parameter manually. If possible then you can set it to the e=K-1, where K is the number of expected distinct populations. Alpha shouldn't due too much expect smoothing admixture proportions.
Let me know if you have any further questions. :-)
Best, Jonas
Hi Jonas,
Thanks very much for the swift response! That makes sense, I'll run through the clusters manually. I appreciate the help!
Best, Blair
Hi, I'm trying to run Admixture models on 93 samples from 12 populations with low coverage WGS data (~2.5X). I keep obtaining large log likelihoods with my models (see error output below), regardless of the number of clusters I specify. I'm wondering if this issue is related to the alpha parameter and whether you have any recommendations for overcoming the issue. I was also wondering whether PCAngsd chooses the best cluster (in this case 2), or whether I should manually change the -e parameter to find the best model. Thanks in advance!
CMD:
pcangsd.py -beagle WGR_genolike.beagle.gz -o PCAngsd/PCAngsd_Admix_1 -admix -threads 12
OUTPUT:
Best, Blair