nipunsadvilkar / pySBD

🐍💯pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence boundary detection that works out-of-the-box.
MIT License
801 stars 83 forks source link

How to separate sentences when there is no punctuation? #124

Closed asasas234 closed 1 year ago

asasas234 commented 1 year ago

for example:

how are you fine thank you and you

I hope to be able to divide the above words into 3 sentences.

  1. how are you
  2. fine thank you
  3. and you
kongyurui commented 1 year ago

Probably need some sort of machine learning module to do that kind of sentence breaking. pySBD uses heuristics based on punctuation.

nipunsadvilkar commented 1 year ago

Yes pySBD is rules based and heuristics module so wont work on use-cases where there not all any punctuation.