uhh-lt / ethiopicmodels

Different semantic models for Amharic
17 stars 9 forks source link

sentence segmenter has issues #1

Closed abinew closed 2 years ago

abinew commented 2 years ago

The following code has an error

from amseg.amharicSegmenter import AmharicSegmenter
sent_punct = []
word_punct = []
segmenter = AmharicSegmenter(sent_punct,word_punct)
words = segmenter.amharic_tokenizer("እአበበ በሶ በላ።")
sentences = segmenter.t("እአበበ በሶ በላ። ከበደ ጆንያ፤ ተሸከመ፡!ለምን?"))
seyyaw commented 2 years ago

fixed -- see here