Open marcoagpinto opened 2 years ago
@danielnaber Hello, I am working on this issue.
As I tested some cases, I think this is caused by wrongly detect the end of sentence.
I found that the paragraph is divided to sentences here
but the SrxTextIterator function is in jar file, so I cannot fix it. Do you have any idea or suggestion? Than you.
The rules for sentence splitting are in segment.srx
. I'm not sure how important this issue is, though. It doesn't look like a common error to me.
Hello!
I want to know .test.
It doesn't suggest removing the space before the first “.”.
The real reason I am reporting this is because I need that rule in English so that I can port it to Portuguese and add file extensions to the antipatterns:
GLOBAL ENTITY filename_extensions
.Thanks!