nipunsadvilkar / pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
MIT License
813 stars 84 forks source link

Extend normal text rules #51

Closed neogyzhu closed 5 years ago

neogyzhu commented 5 years ago

Add pdf rule NewLineInMiddleOfSentenceNoSpacesRule to normal text rules so it can handle cases like https://rubular.com/r/eaNwGavmdo in a normal text

neogyzhu commented 5 years ago

will find a better way to do this in a follow up PR