quanteda / stopwords

Multilingual Stopword Lists in R
http://stopwords.quanteda.io
Other
113 stars 9 forks source link

There is a problem with the cs or Czech list #41

Closed vikram-rawat closed 2 years ago

vikram-rawat commented 2 years ago

I am not a native speaker of slovak or czech and so I can't read it properly but I am sure... this dictionary is not translated into UTF-8 by any means

"ä\u008dau"      "ä\u008dtrnã¡ct" "ä\u008dtyå™i" 

same thing happens with sk

"veä\u008der"  "ä\u008dtyå™i" 

Please fix the encoding issue.

code used is

  stopwords::stopwords(
    language = "sk",
    source = "stopwords-iso"
  )