Closed dmcc closed 10 years ago
Previously I have changed #define MAXSENTLEN 400
to #define MAXSENTLEN 999
in Feature.h file. Now I am using unmodified version of parser available on github.
I have tried to run this parser on different two Linux machine.
Machine - 1 Information :
Fedora release 14 (Laughlin)
Linux version 2.6.35.6-45.fc14.x86_64 (mockbuild@x86-02.phx2.fedoraproject.org) (gcc version 4.5.1 20100924 (Red Hat 4.5.1-4) (GCC) ) #1 SMP Mon Oct 18 23:57:44 UTC 2010
LSB Version: :core-4.0-amd64:core-4.0-noarch
Machine - 2 Information : Fedora release 18 (Spherical Cow) Linux version 3.6.10-4.fc18.x86_64 (mockbuild@) (gcc version 4.7.2 20121109 (Red Hat 4.7.2-8) (GCC) ) #1 SMP Tue Dec 11 18:01:27 UTC 2012 LSB Version: :core-4.1-amd64:core-4.1-noarch:cxx-4.1-amd64:cxx-4.1-noarch:desktop-4.1-amd64:desktop-4.1-noarch:languages-4.1-amd64:languages-4.1-noarch:printing-4.1-amd64:printing-4.1-noarch
Please refer below link for debug information. (errorFile) https://www.dropbox.com/sh/kkeqm9rfik5qplc/byLd4mzmJS
I have used this command in java by providing errorFile and outFile : ./parseIt, -d100, -l400, -K, -t1, -EinTagFile.txt, ../DATA/EN/, inTextFile.txt
Thanks the trace was useful -- the issue shows up when it reparses sentences without POS constraints if the POS constrained parse fails.
The good news is that I've now been able to replicate the issue. The better news is that I've fixed it. Please reopen if this doesn't fix the issue and keep the bug reports coming!
Issue is fixed now. Thank you so much.
Continued from https://github.com/BLLIP/bllip-parser/issues/17#issuecomment-33200162
@gvjoshi25, I can't replicate this issue on Ubuntu or RHEL machines. Just to confirm, you're using an unmodified version of the latest parser and get the same assertion error every time?
If so, what *NIX distribution and version are you using? Also, can you include the output of these commands:
It might also be interesting to see the output of the parser with the debugging flag on (
-d100
) but note that this produces a lot of output (2.4MB for me).