Open ismajl-ramadani opened 4 months ago
I too noticed this while working on something else.
Personally, I'd like to see the ids added as an Option<Vec<String>>
or similar to add_documents
, but this will break backward compatibility.
@Abraxas-365 @prabirshrestha, any thoughts on this?
I'm good with the breaking change. Feel free to send a PR.
I want to add custom IDs for documents so that when I do re-indexing for updates, I can reference them with the ID I have in the data source.
My specific case is with the OpenSearch vector store. Right now as a workaround, I'm running a local build of this crate, and I have added a function with the following signature:
and then I'm zipping the ids together with the docs and vectors
and finally adding the id to the docs
I wanted to ask if someone would be interested in having this feature as well, and if yes, any suggestions from maintainers on how to implement this without having to break the core trait of VectorStore somehow.
Also, Python package has one additional field for the add_docs, called ids and you can see it in this file: https://github.com/langchain-ai/langchain/blob/29aa9d67506ac07b92d37d58c684ce3c6dc290cd/libs/community/langchain_community/vectorstores/opensearch_vector_search.py#L587