Closed ghaddarAbs closed 5 years ago
Also, the algo split sentence led by an enumeration containing dot ( 1. XXXXXX YYYYYY ......) into 2 sentences as follow: 1. XXXXXX YYYYYY ........
Hi!
Cases like this are managed by what's called the "non-breaking prefixes". The file for non-breaking prefixes for French is here:
It would be very helpful if you helped us out by submitting a PR with some more such prefixes added, or at least with some test cases for the French language.
Fixed in v1.4.
Great Work, It seems weird for me that the algo couldn't split this sentence correctly:
The algo consider it as 2 sentences (split at Inc.), though the example isn't hard.