Open GoogleCodeExporter opened 8 years ago
[deleted comment]
[deleted comment]
Nevermind. I was just being an idiot.
The segfault was being brought on by an inconsistency in the case of some
vocabulary
terms in the Vertanen model, a problem which I introduced. Specifically, some
instances of a word were upper case while other instances in other ngrams were
lowercase. Once I fixed this the problem disappeared.
Original comment by Josef.Ro...@gmail.com
on 28 Feb 2010 at 2:16
In case you ever encounter the segfault, I recommend to recompile the sources
with
"make DEBUG=1" which will turn on all assertions and the application will
probably
break on a failure of some assertion which will give you much better idea about
what
happened.
For example I found out that if there is a 3-gram "A B C", there has to be also
the
2-gram "B C" in the ARPA model, otherwise an assertion fails, which was
probably the
reason of my segfaults. It happened probably because of LM pruning with srilm.
Miso Fapso
Original comment by michal.f...@gmail.com
on 21 May 2010 at 6:29
Original issue reported on code.google.com by
Josef.Ro...@gmail.com
on 28 Feb 2010 at 1:47