pevicks / negex

Automatically exported from code.google.com/p/negex
0 stars 0 forks source link

Passing in "paragraphs" to the GenNegEx can result in array out of bounds #1

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
I wasn't sure how else to contact developers for this project...

When inputting a paragraph instead of a sentence, certain paragraphs can
result in an ArrayOutOfBoundsException.  This results from multiple
identifications of various parts ([CONJ], [PREP], etc.) in different
sentences.  If the indexes passed to the substring() function when pulling
out the scope are inverted, the subtracted value is negative.

While this isn't a big issue, I have questions regarding the algorithm
overall.  What is the effect of passing in an entire paragraph as opposed
to a single sentence?

Original issue reported on code.google.com by and...@na-consulting.net on 16 Jul 2009 at 7:17

GoogleCodeExporter commented 8 years ago
Thank you for your comment. 

This issue is probably correct, although I did not try to recreate the problem.

As the wrapper code shows, GenNegEx requires sentence preprocessing and sentence
boundary detection before passing the text to GenNegEx. This does not mean that 
we
should not fix this issue in the next release but it also means that GenNegEx 
does
not try to solve anything else than finding negations and it assumes correct
(sentence level) input.

Original comment by imre.so...@gmail.com on 2 Aug 2009 at 9:24