Open JoyeBright opened 3 years ago
Hey @JoyeBright
Now for similarity search you try the following options: If you want to implement from scratch you can use the following:
Production ready pipelines you can use:
Have a look at ANN here: https://www.sbert.net/examples/applications/semantic-search/README.html#approximate-nearest-neighbor
Greetings,
Just wanted to know what technique/ implementation you suggest for finding similar sentences over a very large corpus? 31M sentences, for instance!
Worth mentioning that my question is more concerned with memory management rather than speed.
Cheers, Javad