-
https://github.com/se-asr/nst
-
Hello, it would be great to add read-aloud functionality, especially using Google cloud TTS that is great. Here's a great starting point https://github.com/ken107/read-aloud.
-
-
This thing needs to respond back to us on some event.
Right now, the strategy to reduce latency is to generate precanned responses constantly. Maybe we can also follow the same strategy with some TTS…
-
A user should be able to have their summarization spoken back to them.
A user when launching the script should be asked if they would like to have their summarization read back to them only if they…
-
### Describe the bug
ERROR: Failed building wheel for TTS when building/installing TTS
### To Reproduce
Clone repo and pip install -e .
### Expected behavior
_No response_
### Logs…
-
-
### Describe the bug
I followed the demo code provided by Coqui to create a simple dataset and fine-tune a model using Gradio. However, when I load the model and perform inference, the output audio i…
-
**Is your feature request related to a problem? Please describe.**
If you would like to use the wasm client, and want to instantiate STT, it involves downloading or loading multiple 100+ mb files. If…
-
the use of `webdataset` in training was added in release `v1.2.0`, but we don't have docs for it's usage yet in `stt.readthedocs.io`