kenlm Search Results - Githubissues

1000+ results
for kenlm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

kpu/kenlm #305

Computing perplexity on different sized text corpuses

Hello, I'd like to compute perplexity on different text corpuses given an ngram computed with kenlm. I found in some old issues that `--vocab_pad` param should be used with a big number in similar sit…

tomassykora updated 2 years ago
1
kpu/kenlm #285

Arbitrary token boundaries

Is there a way to do ngram estimation with custom token separation? The idea would be to get the following behavior: `Hi, this is a sentence.` -> `Hi`, `,`, `this`, `is`, `a`, `sentence`, `.` `My em…

gkucsko updated 4 years ago
1
kpu/kenlm #427

Wrong calculation of 1-gram adjusted counts?

I'm writing a Python script that mimics the behavior of lmplz. When I tested it out on a large corpus, I found the estimated probabilities differed slightly from lmplz's output. By shrinking the c…

MaigoAkisame updated 1 year ago
1
parlance/ctcdecode #205

problem with CTCBeamDecoder.decode() when using a big (.arpa…

i'm interested in using the kenlm LM to decode/score outputs of my speech recognition model. when I initiate my CTCBeamDecoder with model_path='./test.arpa', which is a pretty small .arpa file just…

aybberrada updated 1 year ago
1
tensorflow/tensorflow #76794

Multithreading is not working with teansorflow

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tensorflow==2.15.0.post1 ### Custom code Yes ### OS platform and dist…

KunduruJayasimhareddy updated 1 month ago
3
kpu/kenlm #187

Segmentation fault by estimating

I am trying to estimate language model on Raspbian. I got a segmentation fault when running `kenlm/build/bin/lmplz -o 4 --prune 0 1 2 3 --limit_vocab_file vocab.txt --interpolate_unigrams 0 lm.arpa`. …

gospodima updated 6 years ago
2
facebookresearch/fairseq #3817

Tensorboard writers are not cleared between hydra configurat…

## 🐛 Bug Tensorboard writers are not cleared between hydra configurations ### To Reproduce This problem was spotted while running training of Wav2Vec-U with default parameters.: ```PREFIX=…

prokotg updated 3 years ago
3
kpu/kenlm #423

python setup.py install error

zscwind updated 1 year ago
2
danpovey/pocolm #20

Min-counts

I'm adding a note here, although this is not really an 'issue' in the normal sense. I just checked in code that supports enforcing min-counts. This should make the process of building and pruning LM…

danpovey updated 8 years ago
4
flashlight/flashlight #669

Possible to return word/token level confidence and time stam…

### Question Dear Sirs, I am currently using the Python binding for Flashlight, working with the LexiconDecoder and KenLM classes to build a decoder for an ASR model I have. I currently call the d…

trias702 updated 3 years ago
1

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for kenlm

1000+ results
for kenlm