Cosdata: A cutting-edge AI data platform for next-gen search pipelines. Features semantic search, knowledge graphs, hybrid capabilities, real-time scalability, and ML integration. Designed for immutability and version control to enhance AI projects.
The system shall build a HNSW index on the vectored information, based on the user specified similarity metric, in an automated manner in order to provide accurate results instantly.
Acceptance criteria
Context
The primary reason to store vectored information is to perform semantic search on unstructured data sets. The semantic search has to be perform accurately (with a reasonably high recall) and instantly (u second to milli-second latency). The HNSW index has proved itself to be both effective and efficient in real-world use.
The system will need to be able to automatically build the HNSW index on the vectored information. This should be done in parallel to the data ingestion. Users shall be able to re-index the vectored information as and when required.
Links
Title
Link
Efficient and robust approximate nearest neighbor search using Hierarchical Navigable Small World graphs
Description
The system shall build a HNSW index on the vectored information, based on the user specified similarity metric, in an automated manner in order to provide accurate results instantly.
Acceptance criteria
Context
The primary reason to store vectored information is to perform semantic search on unstructured data sets. The semantic search has to be perform accurately (with a reasonably high recall) and instantly (u second to milli-second latency). The HNSW index has proved itself to be both effective and efficient in real-world use. The system will need to be able to automatically build the HNSW index on the vectored information. This should be done in parallel to the data ingestion. Users shall be able to re-index the vectored information as and when required.
Links
Pending work