Open ecoates-bc opened 1 year ago
It's possible but what if what they got is better than vanilla K-means in terms of the recommendation outcome? I like the idea of choosing your own recommendation algorithm/outcome.
Adopting pure, vanilla K-means clusterning means elon musk's tweets are not prioritized, so I don't think they can do it.
Adopting pure, vanilla K-means clusterning means elon musk's tweets are not prioritized, so I don't think they can do it.
That's a good point! Maybe he could get his own little cluster
It's possible but what if what they got is better than vanilla K-means in terms of the recommendation outcome? I like the idea of choosing your own recommendation algorithm/outcome.
hmmm that's very true, you would not get away with an out-of-the-box, everyday K-means implementation and retain the same degree of being able to tune it on-the-fly per user. I'm sure there could be an implementation that preserved this, maybe client-side post-processing of clusters. But yea either way definitely would not want to sacrifice tuning in order to achieve an implementation.
A like is an input. A like + others who liked the same is another. The time the like happened is another. The difference of the time of the like for the same thing of various users matters. The time of a user spending on said thing that was liked matters. This is probably silly but it goes deeper. The main problem atm is Data. There are tons of possibilities to get Data even with the current architecture. Once Data value is satisfying, then Algorithmic process will be more exciting ( more difficult than it is ). Not even the current version of transformers will be able to give accurate outputs ( in relation with user's well being and market wanting to invest with advertisements).
I agree with your sentiments expressed here. But, K-means presents major drawbacks:
So I've had a look around this repo and it looks like there are a LOT of files in it.
That's not necessarily a bad thing! But maybe if you trimmed your recommender system down a little, it would help your servers' overhead, and that way save you folks a lot of money.
So, I think you should just use k-means clustering instead of whatever spaghetti you have in here currently.
Why K-means clustering?
There are a lot of reasons why you'd use k-means clustering in this case.
Conclusion
I think you should use K-means clustering for your project "The Algorithm." Let me know how it goes!