Open GoogleCodeExporter opened 9 years ago
Same problem with
mitlm64/interpolate-ngram -o 2 -v corpus_50.vocab -u true "corpus1.txt,
corpus2.txt, corpus3.txt, corpus4.txt, corpus5.txt, corpus6.txt" -op
dev_set.txt -wl mix_2.lm
-opt-alg default, so it's LBFGS.
It seems that this problem arises only with really big text corpora (>2 GB).
And backoff = nan only with words that should have "big" negative value
otherwise.
I'm using mitlm 0.4.1 in Cygwin (cygwin1.dll version 1.7.32).
Original comment by verypret...@gmail.com
on 14 Nov 2014 at 2:56
Original issue reported on code.google.com by
hu.xinhu...@gmail.com
on 10 Nov 2010 at 8:15