-
_question_: is this a separate operation of its own, or is this part of https://github.com/sul-dlss/common-accessioning/issues/1358? marking this blocked till we have enough progress on that ticket t…
-
Thank you for your open-source work. I would like to ask you some questions. I tried to use diffloss for a speech generation task, adopting the next token prediction approach. This corresponds to the …
-
Connection timeout to host wss://speech.platform.bing.com/consumer/speech/synthesize/readaloud/edge/v1?TrustedClientToken=6A5AA1D4EAFF4E9FB37E23D68491D6F4&ConnectionId= ....
when i run "edge-tts/ex…
-
**Project description**
Nexa SDK is a comprehensive toolkit for supporting ONNX and GGML models. It supports text generation, image generation, vision-language models (VLM), auto-speech-recognition…
-
after some discussion and some prototyping in https://github.com/sul-dlss/speech-to-text/pull/9 and https://github.com/sul-dlss/common-accessioning/pull/1367, it seems like we've settled on SQS for co…
-
### Is there an existing issue for the same feature request?
- [X] I have checked the existing issues.
### Is your feature request related to a problem?
```Markdown
For the text-to-speech feature, …
-
_Keep in mind the guidelines at the top of https://github.com/sul-dlss-labs/sul_media_transcription_services/issues/1. In particular, the ones about ultimately using Terraform for deployment, and abo…
-
I am interested in “How to use mamba to generate audio”. One of amazing things is the long sequence attention, i wanna know whether mamba can be used in TTS, so that it does not need the Vocoder. Mayb…
-
Hello together,
I am currently trying to use OpenVoice for German language generation. I have not been able to figure out how this zero shot speech synthesis shall work. Is there some kind of multila…
-
### **Description**
We are going to fine-tune Meta's **MMS (Massively Multilingual Speech)** model for **Tibetan text-to-speech (TTS)** using our Tibetan dataset. The pipeline will cover data preproc…