wongjiahau / data-mining-past-year-ans

1 stars 0 forks source link

DM2017PY: opinion in Q5 (c) #2

Open kamito98 opened 6 years ago

kamito98 commented 6 years ago

I am Chin Chee Hoong from data mining course January 2018.

For the Q5 c (ii), actually the f2() is euclidean distance, not minkowski distance (minkowski distance can become Manhattan distance if r = 1)

Then, about the disadvantage of Manhattan distance over the (Euclidean distance) is, we are having a hard time to determine the "actual" distance between the points.

For example, lets use P1 (4,5) and P2(1,8) and P3(0,0). By calculating the euclidean distance, the euclidean distance between P1 and P3, and between P2 and P3, is sqrt(41) and sqrt(65). However, by calculating their Manhattan distance, both results are the same, which is 9!

Manhattan distance is pretty like calculating how many move you need to do so that you can reach the end point from the start point. For example, if you want to reach P2 (5,0) from P1 (0,0), you just need to go left for 5 times, with each time increment x-axis value by 1. That value then can think as the difference (Manhattan distance) of both points.

wongjiahau commented 6 years ago

@kamito98 Thanks for pointing out!