-
Can this program run without noise? Because when the noise is clean, it always has an error.
running python feature_generation.py
"noise name: clean
save_path: /home/dsp/Desktop/fish/kws/SCR-Cap…
-
i cannot find the pretrained models is this link http://tts.speech.cs.cmu.edu/document_grounded_generation/cmu_dog/cmu_dog.zip
-
hello i am attempting to create text to speech with bark on intel a770 but it takes around 60 seconds to generate audio is that normal ? is there a way to make it faster like few seconds ? https://git…
-
Recently I played with @google's speech API and it seems they have a pretty accurate speech-to-text feature. I tested by extracting the audio of some @nytimes videos using
```
ffmpeg -i source -c:a …
-
To make this security enclave accessible to blind users, it should have a text-to-speech output option. Current-generation natural-sounding speech synthesizers are certainly too demanding for the proc…
-
In my speech synthesis system built from Merlin toolkit, it take long time to generate speech from text. Most of the time used by World Vocoder and DNN generation module. So, to improve time delay, I …
-
### Feature request
Please consider implementing Meta's open source Massively Multilingal Speech (MMS) with speech recognition and generation support for over 1000 languages with a drastically reduce…
-
Would there be a way to generate long speeches ?
Because right now, it requires to be fed with at least 3 seconds of speech each time you want to inference something new. And if the length of the …
-
### Describe the bug
only needs a lot time
### Is there an existing issue for this?
- [X] I have searched the existing issues
### Reproduction
every model with whisper small medium ....
### S…
-
```
~ via 🐍 v3.11.9 (insanely-fast-whisper)
❯ pipx list
venvs are in ~/.local/pipx/venvs
apps are exposed on your $PATH at ~/.local/bin
manual pages are exposed at ~/.local/share/man
package …