Closed naborlozada closed 2 years ago
It is generally better to have populations of even sample size for PCA, but unless you have some really large differences, I don't think this is a problem for PCA or pcadapt.
Hi,
So, I understand that my populations (from above) are okay to compare to each other as described.
Thank you for your reply!
I didn't see the numbers.. they are quite low.. I have never used pcadapt with so few numbers, so I don't know. But I guess at least pop2 and pop7 are not okay.
Yes, that was concern. I was thinking to take out those with 5 or less samples (or more "strict", with at least 10 samples). It is an arbitrary filter, though. I'll make some tests. Thanks for your help. best.
Hi all,
I'm analyzing 7 populations (3 from America and 4 from Africa) and trying to find some outliers by using pcadapt. Some populations are located in the same country, while others are very distant. All of them have a different number of samples (individuals):
I try to find a signal of local adaptation (if any) at the level of country and continent, however, I was wondering if populations with few samples (pop2 and pop4) might produce a bias in the analysis. On the other hand, All populations do not have equal number of samples, as in your pcadapt example, so, when making an analysis at the country (for example with populations from country D) or continent (all populations from Africa or America) level, should I remove those with few samples?
How to deal with populations with different number of samples? Is there any test or way to evaluate or make a normalization? Any suggestion? I will really appreciate.
Best regards. Nabor