I have gone through both the UPGMA wiki and the Figure 2 of Dave Thomas. I feel that when updating the distance matrix, they are somewhat contradictory. For instance, in Figure 2 of Dave Thomas, when (B,F) and G are combined, dist(((B,F),G),E) should be (dist((B,F),E)*2 + dist(G,E)*1)/(2+1) = (dist(B,E)+dist(F,E)+dist(G,E))/3, if according to the UPGMA wiki. That would give value of 33 instead of 31.8=(35.5+28)/2. You might would like to check how your codes address this issue. Personally I feel the scheme provided by the UPGMA wiki is more appropriate according to its name unweighted, whereas the example in Dave Thomas is likely to be WPGMA.
Hi there,
I have gone through both the UPGMA wiki and the Figure 2 of Dave Thomas. I feel that when updating the distance matrix, they are somewhat contradictory. For instance, in Figure 2 of Dave Thomas, when (B,F) and G are combined,
dist(((B,F),G),E)
should be(dist((B,F),E)*2 + dist(G,E)*1)/(2+1) = (dist(B,E)+dist(F,E)+dist(G,E))/3
, if according to the UPGMA wiki. That would give value of 33 instead of 31.8=(35.5+28)/2. You might would like to check how your codes address this issue. Personally I feel the scheme provided by the UPGMA wiki is more appropriate according to its name unweighted, whereas the example in Dave Thomas is likely to be WPGMA.