Question--time complexity for high LID dataset

nmslib / hnswlib

Header-only C++/python library for fast approximate nearest neighbors

Apache License 2.0

4.27k stars 629 forks source link

Hi @jianshu93,

This is discussed in HNSW paper. The O(N*log(N)) is asymptotic scaling under (IMO) reasonable assumptions. That means that to observe this scaling on high LID datasets one might have to increase N to astronomical values to observe the expected scaling, similar to other algorithms with logarithmic scalability (e.g. kd-tree).
Yeah. HNSW has an approximation of relative neighborhood graph which is very similar to the kNN graph (and better for routing), so extracting nearest neighbor should be a cheap operation there.

nmslib / hnswlib