-
OpenAI GPT 4o model supports both in and out of text, image and audio. Understanding is finer than usual STT > model > TTS approach because the model has direct access to user behavior, emotions, etc.…
-
Multilingual Speech-to-Text from 🐸STT can be easily added to the project:) We're (@coqui-ai) happy to help!
https://github.com/coqui-ai/stt
-
### Objective
Add (optional) whisper STT to the backend deployments
### Initial Implementation Requirements
- Whisper STT container with gradio interface
### Other Considerations
- Consider repla…
-
Currently, I host two STT and TTS containers, one pair for Home Assistant, one for Open-WebUI.
If openedai-whisper added support for the Wyoming protocol, it could also be used directly in Home Assis…
-
Any plan to add other TTS/STT models?
-
您好,在运行代码时遇到这个问题
```
Namespace(epoch=50, batch_size=16, d_model=64, n_warmup_steps=1000, dropout=0.3, embs_share_weight=False, proj_share_weight=False, log=None, save_path='./checkpoint/DiffusionPred…
Ywinh updated
2 weeks ago
-
I'm having trouble capturing timing information with VAD + STT.
given:
```python
openai_stt = openai.STT()
vad = silero.VAD()
vad_stream = vad.stream()
stt = StreamAdapter(openai_stt, vad_stre…
-
Hi there,
I wanted to express my appreciation for the intriguing and innovative STT method you've developed. It's truly fascinating, and I've been excited to explore the example code you've provide…
-
I stream with friends often and would like to open the discord chat to STT for talking to the buddies. This likely will be unusable until #19 is implemented.
- Having the ability to capture individ…
-
Is there (a plan for) a way to use the OpenAI servers for STT/TTS? They are fairly slow, unfortunately, but they might be a good option for some people.