eklem / stopword-sami

Sami stopword lists for natural language processing. Examples on use could be search engines, machine learning and chatbots.
MIT License
1 stars 0 forks source link

Manually create redlists for the different languages #12

Closed eklem closed 2 years ago

eklem commented 2 years ago

To use each time a crawl has been done and stopword-trainer is to be used

eklem commented 2 years ago

Redlist and cutoff is the manual part of the work. Add as files to datasets/-folder

eklem commented 2 years ago

Just to get started on the manual work: Dictionary search engine for Northern Sami, Lule Sami and South Sami, that has translations to Norwegian. https://nb.glosbe.com/sma/nb/jeemie

eklem commented 2 years ago

Added three files to redlists-folder, all with empty arrays for now.