diasks2 / pragmatic_segmenter

Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
MIT License
549 stars 55 forks source link

Feature/english test cases #49

Closed EliotJones closed 6 years ago

EliotJones commented 6 years ago

Adds support for the following cases:

I'm not sure if the 2 regexes for chinese book quotes should instead be moved into the chinese language instead? There are other types of quotes as well that may need inclusion: https://chinese.stackexchange.com/questions/517/when-are-and-and-other-brackets-used

diasks2 commented 6 years ago

Thanks so much for working on these improvements! I'll bump the version and deploy.