-
```
(deepspeech-venv) josh@stealth:~/git/DeepSpeech/kenlm/build$ make
[ 0%] Building CXX object util/CMakeFiles/kenlm_util.dir/double-conversion/bignum-dtoa.cc.o
[ 1%] Building CXX object util/CM…
-
### 🐛 Describe the bug
I was messing around with the example code found in https://github.com/pytorch/audio/tree/main/examples/libtorchaudio/speech_recognition
I compiled using the build.sh bash f…
-
你好,我看了一下你训练的样本中,数字部分,相连的数字不用空格隔开,这是kenlm对格式的要求吗?还是有其他考量?
-
I knew when using SRILM ,we can use the -limit-vocab -vocab vocab_file to train the lm,so the number of 1-gram words will be the same with the number of words contained in a vocab.
When i use Kenlm, …
-
Hello,
I am trying to migrate my ASR model from OpenSeq2Seq decoder to Flashlight. Currently, I am using [Nemo Conformer large](https://github.com/NVIDIA/NeMo/blob/main/examples/asr/conf/conformer…
-
Hi, could you please tell me, can I merge a few large lms in ARPA format using KenLM?
I looked through existing issues, but couldn't find an answer:
* In #146 you said it can be done, but options fo…
-
Hi,
I have successful run all those steps in README and have bible.arpa bible.binary but there is no trie file
How can I generate trie? I cant find any tutorial about this
-
C++ 代码
lm::ngram::Config config;
model = new lm::ngram::Model(language_model_path);
初始话C++ 版本的 kenlm 模型报错
相同代码时而能成功 ,但常时间下是报错的
Segmentation fault (core dumped)
段错误 可能是哪儿指针存在问题,求大神救救我
-
Hello
I'm trying to download the est_republicaine corpus to train the French language model using KenLM, when I click on the link, it gives me this error "nginx error! The page you are looking for is…
-
Hi,
Perhaps, this is a naive question, but do we need to retain non-Lexicon words in KenLM file? I haven't read the KenLM paper but do backoff and smoothing require retaining other non-Lexicon word…