calculate the distance between user's ratings to cluster's centers

anyscale / academy

Ray tutorials from Anyscale

Apache License 2.0

586 stars 195 forks source link

Thanks for pointing out that code @YeziPeter , it needs more comments describing what happens at that point. Had to dig through to find an answer for that one :)

In the JokeRec.load_data() method where the data gets loaded, these ratings are scaled in advance. The raw data ranges [-10, 10] but the scaled data ranges [-1.0, 1.0]

Then these scaled rating values get used as the sample data for the clustering.

The c[item] value is the cluster center for ratings of a particular item (an individual joke), not a number of users who've rated an item. This is scaled the same way as the rating values.

Does that help?

anyscale / academy

calculate the distance between user's ratings to cluster's centers #35