-
in sts tab @ any voice i uploaded selected , output is alway same one (cn-nan) while voice from the code(such as cn-nan.wav, cn-XiaoyiNeural) selected, output is the voice selected.
-
Recently I played with @google's speech API and it seems they have a pretty accurate speech-to-text feature. I tested by extracting the audio of some @nytimes videos using
```
ffmpeg -i source -c:a …
-
### Title
A quantum-inspired sentiment representation model
### Team Name
Noah
### Email
202311016@daiict.ac.in
### Team Member 1 Name
Harsh Vyas
### Team Member 1 Id
20231…
-
### Feature request
Please consider implementing Meta's open source Massively Multilingal Speech (MMS) with speech recognition and generation support for over 1000 languages with a drastically reduce…
-
https://charactr-platform.github.io/vocos/
It looks like it might improve the audio quality for speech generations.
-
Hi! !مرحبا! السلام عليكم
Let's bring the documentation to all the Arabic-speaking community 🌏 (currently 0 out of 267 complete)
Would you want to translate? Please follow the 🤗 [TRANSLATING guid…
-
# Speech Separation
Speech separation is the task of obtaining clean, single-speaker speech from a speech mixture of multiple overlapping speakers.
## Task Objective
**Why is this task needed…
-
To make this security enclave accessible to blind users, it should have a text-to-speech output option. Current-generation natural-sounding speech synthesizers are certainly too demanding for the proc…
-
Hello @lucidrains
I would like to test the pre-trained models for speech generation
How would I be able to do that.
-
Hi, thank you for your interesting work. I was trying to reproduce the experiment results using the code provided and had some questions.
1. I have attempted to run the `train_and_eval.sh` script t…