nomic-embed-text-v1 is 8192 context length text encoder that surpasses OpenAI text-embedding-ada-002 and text-embedding-3-small performance on short and long context tasks.
Types of changes
[x] New feature (non-breaking change which adds functionality)
Checklist:
[x] My code follows the code style of this project.
[x] My change requires a change to the documentation.
This PR introduces nomic embeddings to Spark NLP
Description
nomic-embed-text-v1 is 8192 context length text encoder that surpasses OpenAI text-embedding-ada-002 and text-embedding-3-small performance on short and long context tasks.
Types of changes
Checklist: