Open ppaanngggg opened 3 weeks ago
+1 also consider supporting any multimodal embedding model. This is the biggest blocker to us adopting
We are currently figuring a list of integrations and the priority in which we are going to tackle them. We'll keep this one in mind during our planning.
Thanks for submitting the issue.
What problem does the new feature solve?
jina-clip-v1 is the best multi-modal embedding model now.
What does the feature do?
It can be used to build better image retrieval application.
Implementation challenges
According to the api https://jina.ai/?sui&model=jina-clip-v1
We need to pass plain text as
or image from url or base64 encoded as
Are you going to work on this feature?
🆘 No, could someone else please consider working on it?