estimate-lm segfaults when specifying a vocab file

eddieantonio / mitlm

Automatically exported from code.google.com/p/mitlm

http://code.google.com/p/mitlm

BSD 3-Clause "New" or "Revised" License

1 stars 0 forks source link

estimate-lm segfaults when specifying a vocab file #31

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago

What steps will reproduce the problem?

Run estimate-lm -v vocab.file -t training.txt -wl lm.arpa. Where vocab.file 
contains a subset of the vocab in the LM training text.

What is the expected output? What do you see instead?

Program output:
0.000   Loading vocab vocab...
0.010   Loading corpus training.txt...
Segmentation fault

What version of the product are you using? On what operating system?

Latest SVN, ubuntu 10.04 64bit

Original issue reported on code.google.com by edohashira@gmail.com on 14 Jul 2011 at 4:34

GoogleCodeExporter commented 8 years ago

estimate-lm should be estimate-ngram

Original comment by edohashira@gmail.com on 14 Jul 2011 at 4:35

GoogleCodeExporter commented 8 years ago

[deleted comment]

GoogleCodeExporter commented 8 years ago

I'm getting the same error when executing 

./mitlm/estimate-ngram -vocab corpus.vocab-sorted-100K -text corpus.shrunk 
-write-lm corpus.lm

on Ubuntu 10.10/32

As a test, the issue still occurs when the corpus size is either 28M or 58M 
words.

I tried to test in GDB and DDD, but neither would load the executable despite 
the 
CFLAGS= g being in the makefile.

Original comment by jason.jo...@maine.edu on 29 Nov 2011 at 6:05

GoogleCodeExporter commented 8 years ago

Probably you did not run "make install". If you are not trying to load the 
installed binary, you are probably trying to load a script.

Original comment by giuliop...@gmail.com on 3 Nov 2012 at 9:45