-
Hi there,
I am wondering what does fbank really give us in the dataloader? I went to torchaudio doc and did not find much info about what it is. Does anyone have a link to its explanation?
Thank…
-
【问题】
按照项目描述操作,运行prepare_kaldi_feats.sh后没有生成.tsv 结尾的文件,请问这个文件怎么生成?
【项目描述如下】
1、利用kaldi提取40维mfcc特征,运行脚本参考prepare_kaldi_feats.sh
可将运行脚本prepare_kaldi_feats.sh与参数设置mfcc_hires.conf置于kaldi任一egs目录下(与cmd.…
ghost updated
3 weeks ago
-
Hi, dear author, setting grammar in Vosk is very useful, so I copy UpdateGrammarFst() to Kaldi and test it with open source chain model(http://kaldi-asr.org/models/m13), but the result is bad( I set f…
-
[Complation from source](https://alphacephei.com/vosk/install#compilation-from-source)
[Dockerfile.manylinux](https://github.com/alphacep/vosk-api/blob/master/travis/Dockerfile.manylinux)
[kaldi-vos…
Tatsh updated
2 weeks ago
-
Thanks for your amazing project sherpa-ncnn and it provide an alternative excellent ASR engine to open source community and developers.
Could you implement TTS using next-generation Kaldi with [ncn…
-
如果我用librosa 进行特征抽取
# --use-energy=false # use average of log energy, not energy.
# --sample-frequency=16000 # Switchboard is sampled at 8kHz
# --num-mel-bins=40 # similar to Google's setup.
…
-
For Reproducing your issue
Please fill out the following:
Corpus structure
What language is the corpus in? Mandarin
How many files/speakers? 4
Are you using lab files or TextGrid files for inpu…
-
Not running (should not matter that its linux because all the python libs exist here too)
Reproduction:
1. Fresh out of the box (from git) clone of Tacspeak. (git clone ...)
2. python -m venv ./.ve…
-
Do you have the procedure for fine tuning the model with our own data?
Would we use this model
https://github.com/daanzu/kaldi-active-grammar/releases/download/v1.4.0/kaldi_model_daanzu_20200328_1ep…
-
感谢发布该项目,原项目的kaldi配置确实十分头疼
看到目前只支持模型inference,想请问是否考虑增加finetune功能?