womogenes / semantle-bot

8 stars 2 forks source link

enhancement suggestion #1

Open dandanua opened 2 years ago

dandanua commented 2 years ago

This brute-force solution will not work if semantle's reported distance will have very low accuracy.

An intelligent way is to search for words that represent different clusters of words. Then each cluster you split into subclusters that have their own word representatives. And so on.

It's much harder to implement, of course. The question about how we pick clusters is very hard itself.

womogenes commented 2 years ago

This is a great idea to make the algorithm faster/more robust! It would be very interesting to investigate further.