manticoresoftware / manticoresearch

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
https://manticoresearch.com
GNU General Public License v3.0
8.97k stars 498 forks source link

Support for Namespaces in Vector Search #2321

Open anupriya17 opened 4 months ago

anupriya17 commented 4 months ago

Proposal:

Thank you for your amazing product in the open-source world with enterprise features. Does ManticoreSearch supports namespaces in vector search similar to how Cloudflare VectorDB or Qdrant VectorDB support them.

Namespaces provide a way to segment the vectors within your index. For example, by customer, merchant, or store ID. When a namespace is provided, only vectors within that namespace are used for the search. Namespace filtering is applied before vector search, not after.

This feature would be highly beneficial for organizing and efficiently querying vectors in various segments in single collection or table

https://developers.cloudflare.com/vectorize/best-practices/insert-vectors/#namespaces https://qdrant.tech/documentation/guides/multiple-partitions/

Checklist:

To be completed by the assignee. Check off tasks that have been completed or are not applicable.

- [ ] Implementation completed - [ ] Tests developed - [ ] Documentation updated - [ ] Documentation reviewed - [ ] Changelog updated - [x] OpenAPI YAML updated and issue created to rebuild clients
sanikolaev commented 4 months ago

Thanks for the feature request. Manticore doesn't support it now and you have to create a separate table for each tenant, but we'll discuss if we can add easier support for it.

anupriya17 commented 4 months ago

Thanks and looking forward to it.

sanikolaev commented 3 months ago

@anupriya17

This feature would be highly beneficial for organizing and efficiently querying vectors in various segments in single collection or table

Can you provide any benchmarks, studies or smth on when segmenting vectors into namespaces helps in terms of performance / accuracy?