-
你好,在用audio-server-online2-nnet3 解码时,按照readme上的调用方式localhost:5010/'scp:data/wav/wav.scp' 报 如下错误
LOG (audio-server-online2-nnet3[5.2.79~2-ff2ad]:Collapse():nnet-utils.cc:770) Added 10 components, rem…
-
Everything has worked so far up until make in the ext folder aka the last command in install.sh
here is the console output:
Not building with cuda!!!
g++ -std=c++11 -I.. -isystem /home/theo/cod…
-
Hi,
I am trying to get MFA work in my docker image(for some reason it must be based on ubuntu16.04). The prebuild binaries didn't work due to glibc version. So I am trying to build those binaries fro…
-
Hi, have been using your Simulator functionality and found it quite useful. However, the augmented data I'm obtaining from it has a ton of reverb (more than I'm expecting). Still diagnosing the proble…
-
In Kaldi, in egs/tedlium/s5_r2/local/train_ted_lm.sh [or something like that], we have a script that trains the pocolm LM.
Because this was set up before we had train_lm.py, it doesn't use train_lm.py…
-
https://github.com/Sui-Siann-Dataset/Sui-Siann-Dataset/pull/152#issuecomment-465941181
下底有一句一句ê音檔,敢有看著
彼是kaldi照羅馬字切--ê
若是一校ê羅馬字差傷濟,kaldi會切uainn,羅馬字略仔改好了後ē-tàng予kaldi重切
-
I encountered a problem that affects users whose profiles are stored on the network. This problem occurs when using the function of converting video\audio to text using the Vosk-Kaldi method.
I found…
-
Such as:
- [ ] source
- [ ] duration
- [ ] language
- [x] speaker segments
- [ ] silences
- [ ] WER
- [ ] music segment count
-
Hi, I am using MFA for force alignment between phonenes and audio, I want to know nnet3 or chain model is used to train MFA from scarch? As I know than tdnn in nnet3 is better for alignment.
-
Dear Guenter,
It will be very helpful to know hardware requirements, to avoid problems with lack of RAM, HDD or GPU RAM and wasting time to trying to train using not enough powerful computer.
I un…