-
**Name of the feature**
*In general, the feature you want added should be supported by HuggingFace's [transformers](https://github.com/huggingface/transformers) library:*
- *If requesting a **model…
-
### 🚀 The feature
I'm wondering if there are any researchers out there that can search an audio stream like an mp3 and determine whether or not the track is purely spoken word versus a song or musi…
-
老师您好,请问你们有尝试过在帧级speaker embedding上面拼上使用预训练的说话人认证模型提取出的speaker embedding的相关实验吗?我这边在尝试这种做法,但实验效果一直没有达到预期
-
Hi,
first, thanks for this implementation of WaveNet!
I'm interested in performing feature extraction from raw audio files. this features will be used for different tasks such as voice activity de…
-
Hi, I'm new to Unity, how can I use this library, can I request a detailed manual?
-
Hello,
Thanks for your interesting work. I do want to check if the pre-trained checkpoints are available
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
Depending on how hackable the ncurses interface is would it be possible to have actual voice chat support?
-
Hello! This is a great repository, thank you very much @sanchit-gandhi!
We would like to use this repository in our system, but quite a few of our Word-Error Rate (WER) regression tests fail when …
-
Any noice or intense sound is classified as a human voice.
ababo updated
2 months ago