-
Assigned to Abhinav and Karthik
-
### Feature request
Implement the new feature to support a pipeline that can take both an image and text as inputs, and produce a text output. This would be particularly useful for multi-modal tasks …
-
### System Info
transformers==4.45.2
### Who can help?
_No response_
### Information
- [x] The official example scripts
- [ ] My own modified scripts
### Tasks
- [ ] An officiall…
-
Adding live text to the editor as you speak.
-
Right now it uses twilio's builtin speech to text functionality, I want you to integrate Whisper as the STT model via Groq
-
![Uploading image.png…]()
i will be changing this text to relatable text....
plz assign this issue to me..
-
_blocked by #1358_
_possibly blocked by https://github.com/sul-dlss/speech-to-text/issues/21_
_question_: is this a separate operation of its own, or is this part of https://github.com/sul-dlss/co…
-
hi there,
when I use minhash with lsh or simhash, it's hard to remove short text. anybody could provide some useful method to solve this problem, thanks a ton!
take below example, and dive…
-
### Describe the bug
[/usr/local/lib/python3.10/dist-packages/gradio/external.py](https://localhost:8080/#) in from_model(model_name, hf_token, alias, **kwargs)
368 fn = client.image_to_…
-
from the paper I see that you first embed text with text-embedding-3-large, then you use your trained projection network from the contrastive learning.
Can you also release the pretrained text proj…