question about the dist function

Hello, I was going through your paper, and something didn't make sense with regard to the calculation of the D^p matrix. My confusion stems from the fact that the indices in the vector p are starting from 0 (in your example in figure 1, the word "eats", which is the parent of "monkey", "banana" and "eats" is defined as located in index 3, rather than 2, if we were to start the count from 0), whereas in the equation for calculating dist(p_t,j), j is counted from 0 rather than 1. Doesn't this create some sort of discrepancy? For example, in your example in figure 1, if I look at the first row, then for the five entries of this row we get in the exponent: (0,0): -((0-p_0)^2)/2 = -((0-2)^2)/2=-4/2=-2 (0,1): -((1-p_0)^2)/2 = -((1-2)^2)/2 = -1/2 (0,2): -((2-p_0)^2)/2 = -((2-2)^2)/2 = 0 (0,3): -((3-p_0)^2)/2 = -((3-2)^2)/2 = -1/2 (0,4): -((4-p_0)^2)/2 = -((4-2)^2)/2 = -2

So in fact we get that the highest value is in j=2 (corresponding to "eats"), even though the first word's parent is "monkey" (index 1). I would really appreciate if you could help me understand what I am missing. Thanks!

e-bug / pascal

question about the dist function #6