weaviate / weaviate

Weaviate is an open-source vector database that stores both objects and vectors, allowing for the combination of vector search with structured filtering with the fault tolerance and scalability of a cloud-native database​.
https://weaviate.io/developers/weaviate/
BSD 3-Clause "New" or "Revised" License
11.17k stars 773 forks source link

Feature Request: Add A Range-Specific Index to improve range filter queries with very high cardinality #3739

Open etiennedi opened 11 months ago

etiennedi commented 11 months ago

If the cardinality for a range query is very high (i.e. it matches many distinct values) the current approach is essentially a full-table scan on the inverted index. This runs very slow on large datasets. This ticket is a feature request to build support for native range queries

Bitmaps can be used to to build specific range-based indexes that can retrieve all matches in a very few lookups.

etiennedi commented 11 months ago

Some thoughts:

Based on this, I would suggest that: