eklem / stopword-sami

Sami stopword lists for natural language processing. Examples on use could be search engines, machine learning and chatbots.
MIT License
1 stars 0 forks source link

Do calculation again #31

Open eklem opened 5 months ago

eklem commented 5 months ago

Do the calculation from scratch. Check if errors in stopwordiness goes away. Seems "O" and "D" should not be stopwords for Northern Sami

eklem commented 5 months ago

Same error. So an error in either words-n-numbers or stopword-trainer. Easiest is to check with words-n-numbers first, since an error there for sure is caused by the library itself. Split the text into words in an array and count all the Os and Ds. Shouldn't be many.