This PR adds a HNSW index as the coarse quantiser for IVF indexes - avoid brute force search for the centroids closest to the query vector and instead use a HNSW centroid index .
Verified this to not impact recall significantly in E2E testing -
10M dataset - 768 dims
Case | Recall | Accuracy
without change | 0.85795 | 0.897
with change | 0.8565999999999999 | 0.895
5M dataset - 1536 dims
Case | Recall | Accuracy
without change | 0.9591 | 0.979
with change | 0.95811 | 0.981
This PR adds a HNSW index as the coarse quantiser for IVF indexes - avoid brute force search for the centroids closest to the query vector and instead use a HNSW centroid index .
Verified this to not impact recall significantly in E2E testing -
10M dataset - 768 dims Case | Recall | Accuracy without change | 0.85795 | 0.897 with change | 0.8565999999999999 | 0.895
5M dataset - 1536 dims Case | Recall | Accuracy without change | 0.9591 | 0.979 with change | 0.95811 | 0.981