Pragmatic Segmenter is a rule-based sentence boundary detection gem that works out-of-the-box across many languages.
MIT License
549
stars
55
forks
source link
Sentence segmentation is not working as per the golden rule for sentence "At 5 a.m. Mr. Smith went to the bank. He left the bank at 6 P.M. Mr. Smith then went to the store." #82
Sentence segmentation is not working as per the golden rule for sentence "At 5 a.m. Mr. Smith went to the bank. He left the bank at 6 P.M. Mr. Smith then went to the store."
It is creating four sentences
'At 5 a.m. ',
'Mr. Smith went to the bank. ',
'He left the bank at 6 P.M. ',
'Mr. Smith then went to the store.'
Instead, as per the golden rule, it should have created three sentences
"At 5 a.m. Mr. Smith went to the bank."
"He left the bank at 6 P.M."
"Mr. Smith then went to the store."
Sentence segmentation is not working as per the golden rule for sentence "At 5 a.m. Mr. Smith went to the bank. He left the bank at 6 P.M. Mr. Smith then went to the store."
It is creating four sentences 'At 5 a.m. ', 'Mr. Smith went to the bank. ', 'He left the bank at 6 P.M. ', 'Mr. Smith then went to the store.'
Instead, as per the golden rule, it should have created three sentences "At 5 a.m. Mr. Smith went to the bank." "He left the bank at 6 P.M." "Mr. Smith then went to the store."