nipunsadvilkar / pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
MIT License
813 stars 84 forks source link

Starting period edge case #34

Closed dakinggg closed 5 years ago

dakinggg commented 5 years ago

Thanks for fixing all the bugs so fast! I've got another edge case for you. Sometimes when the input only contains space separated periods, the first one is removed:

>>> segmenter.segment('.')
[]
>>> segmenter.segment('..')
['..']
>>> segmenter.segment('...')
['...']
>>> segmenter.segment('. .')
['. .']
>>> segmenter.segment('. . .')
['. .']
>>>