-
## Dataset Format
The pre-processing script expects data to be a directory with:
* `metadata.csv` - CSV file with text, audio filenames, and speaker names
* `wav/` - directory with audio files
The …
-
GitHub:
https://github.com/open-mmlab/Amphion/blob/main/models/tts/maskgct/README.md
Demo Page:
https://maskgct.github.io/
This is probably the current SOTA model, much better than F5-TTS. They …
-
The open-source version does not seem to be using TTS-adapter.
Why did you make this change from mini-omni-1?
-
The Koboldcpp app is amazing. The only issue I see is the TTS occurs after the text is finished which takes forever. Is there a way to have the TTS occur as the text is being outputted to reduce the d…
-
I just saw this TTS model(https://github.com/SWivid/F5-TTS), which works very well for English. Are you planning on including it on the project? Thanks!!
-
The SYN6988 TTS board uses a UART interface to send / receive data. We need to write a neat wrapper for all this functionality so that it will be easier to create a standardized data -> sound output p…
-
### 📦 Deployment Method
Vercel
### 📌 Version
v2.15.7
### 💻 Operating System
Windows
### 📌 System Version
Win10
### 🌐 Browser
Edge
### 📌 Browser Version
130
### 🐛 Bug Description
某些情况下,一些文…
-
Attempt 1 failed, retrying...
TTS request failed, status code: 404
TTS request failed, switching back to mode 2 and retrying
TTS request failed, status code: 404
❌ Error: Failed to get audio durat…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
Hi!
I’m really excited about this project! I have a similar one that uses JavaScript.
I would like to include an option for Speech-to-Text (STT). I’ve found that the Facebook model provides bett…