-
Hello,
I have a question about SE models. I want to get DCCRN single channel model that has the best pesq value for CHiMe4 dataset but couldn't see it in Hugging Face models page.
Thanks.
-
### Describe the feature/enhancement
Hi, Thanks a lot the great app first. I am just wondering if its possible to integrate TTS? I am using some apps with TTS feature to listen epub or aws ebook all …
-
Speech to Text at some point? 👍
http://giderosmobile.com/forum/discussion/7394/speech-to-text-at-some-point
> Since TextInputDialog displays a native keyboard with the speech-to-text option (the …
-
## 🚀 Feature
Add new audio metrics for generative audio processing
### Motivation
The evaluation of speech processing (denoising, dereverberation and in general enhancement) highly depends o…
-
I was thinking we could add Text to Speech to the translations.
I found a couple libraries we could possibly use if it is ok with the Admins.
[https://github.com/mozilla/TTS](url)
[https://git…
-
* [ ] [pystoi](https://github.com/mpariente/pystoi)
* [ ] [speex](https://github.com/imankulov/speex-quality-evaluation) (seems not maintained anymore)
* [ ] [python-pesq](https://github.com/vBaiCai…
-
Hi,
Thanks for your great work. I have several questions and hope you can clarify it.
For the Storm model in paper, what is the batch size (I saw 8 by default in the codes)? How many epochs did…
-
### What happened?
I supplied it a two hour recording of a radio program.
For about 30 minutes of the recording it repeatedly output the line [Music] rather than transcribing the spoken words betw…
-
torch.cuda.OutOfMemoryError: CUDA out of memory. Tried to allocate **137.36 GiB.** GPU 0 has a total capacity of 47.54 GiB of which 44.17 GiB is free. Process 1932274 has 3.36 GiB memory in use. Of th…
-
Hi Dr. Gong, could I know about whether the AST model can be used for speech enhancement task? Especially for testing, each waveform with different length will be fed into the trained model, where the…