Closed BBC-Esq closed 4 months ago
Another option is to use ctranslate2 directly to create the embeddings, and if I switch to faiss, they can be directly input into the vector database. Would simply need to encode a query and make sure the faiss database gets it correctly along with any necessary parameters in order to conduct a search...basically looking for ways to speed up database creation and/or search.
base script:
Faster embeddings via Infinity from well-known ctranslate2 expert:
https://github.com/michaelfeil/infinity