Open timmzak opened 2 years ago
Hi, maybe you can remove Jaccard distance and use Kmeans. Please refer to https://github.com/yxgeee/MMT/blob/057e1ea5d3054c9d7e5fa72c727298d8e4c5f668/examples/mmt_train_kmeans.py
ok, thank you. I will try this solution in the future
Hello!
In your solution matrix of Jaccard distances is precomputed and takes up a lot of space in RAM. Are there ways to solve this problem? This question is relevant for large datasets.
Thanks!