Closed awinml closed 10 months ago
Hello, @awinml! And sorry for the late reply...
It would be a nice integration to have.
If you would like to contribute to this integration, I would propose to:
Feel free to let us know what you think. Thanks!
That sounds good! These models are new and the API access is not fully open yet, so it might take a while for them to become popular.
I will create my own repository for this integration in the meantime and create a PR in haystack-integrations once its ready.
Thanks! @anakin87
@anakin87 I have created a repository (voyage-embedders-haystack) for the integration and also opened a PR (https://github.com/deepset-ai/haystack-integrations/pull/85) in haystack-integrations. Thanks for your help!
Great! We will review your PR in haystack-integrations...
Add VoyageAI Embedding Models support to Haystack (v2.0x)
bge
and OpenAI -text-embedding-ada-002
.voyage-01
andvoyage-lite-01
. The performance comparison shown by them usingvoyage-01
can be seen below.Embedding models released:
Currently supports two models:
voyage-01
Context Length (tokens): 4096 Embedding Dimension: 1024voyage-lite-01
Context Length (tokens): 4096 Embedding Dimension: 1024
VoyageAI also plans to offer embedding models tailored for coding and finance, with more domains on the horizon:
voyage-xl-01
voyage-code-01
voyage-finance-01
More information about their embedding models can be found on their Embeddings documentation.
Simple usage example using the Python SDK:
A complete working example to perform Semantic Search uses these embedding models can be found in this Colab Notebook.
Python SDK API Documentation: https://docs.voyageai.com/embeddings/#via-voyage-python-library
Describe the solution you'd like
Integrate VoyageAI Embeddings with Haystack (v2.0x), specifically a
VoyageTextEmbedder
andVoyageDocumentEmbedder
using the VoyageAI python SDK.