Question: Single vs. Multiple HNSW Graphs for Nearest Neighbor Searches Across Multiple Companies?

nmslib / hnswlib

Header-only C++/python library for fast approximate nearest neighbors

Apache License 2.0

4.38k stars 647 forks source link

Hello,

I'm working on a project that involves using HNSW (HNSWlib) for nearest neighbor searches among data from about 100 different companies (Company A, B, C, D, etc.). Since the data for each company is "unique", I'm trying to figure out the best way to organize this in HNSW graphs to get good search results fast.

I want to know

if it's better to put all the 100 companies' data into one big HNSW graph (would it give a good result?)
if I should create 100 separate graphs for each company to keep the search efficient and accurate.
Is there a better way maybe?

Thank you!

nmslib / hnswlib

Question: Single vs. Multiple HNSW Graphs for Nearest Neighbor Searches Across Multiple Companies? #545