-
Downloading the pre-trained `INTERSPEECH-T-BRNN.pcl` model from [here](https://drive.google.com/drive/folders/0B7BsN5f2F1fZQnFsbzJ3TWxxMms) and then running it on the TED talk data does not yield the …
-
Hello and thank you sharing your great work, but I have some questions.
1. For song vc with Madarian, I tried train a new starganv2vc model with pretrained ASR and F0 model, but the result sound not …
-
Thanks for sharing your code!
This issue is about LogSpectralDistance calculation. Source code is:
`LSD=mean(sqrt(mean((log(CL(RangeBin,1:N))-log(NO(RangeBin,1:N))).^2)));`
It's not consistent …
-
I have tested with different kind of files where emotions are angry happy sad or calm. I am supposed to get valence and arousal values in such a way that 1st quadrant means excited/happy || 2nd quadra…
bmond updated
6 years ago
-
1. As you mention that some performance is worse than rnnoise, can you post some examples ?
2. The DNS-chanllenge dataset is the master branch's fullband data, interspeech2020 or interspeech2021 ? Th…
-
# Task Name
Respiratory Sound Classification
## Task Objective
The objective of this task is to predict if an audio of respiratory sound indicates early-stage fatal lung diseases for better d…
-
Hello
I read that the combination of a generic and a domain specific LM improves the accuracy of a speech recognition model and therefore I wanted to create such a LM. However, I would like to give…
-
We would like a keyword detection enhancement to DeepSpeech, i.e, the ability to detect a key word or phrase directly from a WAV audio file. We saw "keyword spotting" in the Meeting Notes as a potenti…
-
http://www.thereport.co.kr/news/articleView.html?idxno=6044
우울증 진단관련 연구 : 참가자에게 스트레스와 불안감을 의도적으로 유발하는 사회적 스트레스 테스트(Trier-Social Stress Test, TSST)를 활용하여, 그 과정을 통해 나온 음성 데이터를 분석
https://news.join…
-
作者你好,谢谢你的分享!最近在学习lpcnet,请问下你有相应的代码可以跑起来自己训练吗?
X-CCS updated
2 years ago