nltk / nltk_data

NLTK Data
1.4k stars 1.03k forks source link

Please rename Slovene stopwords to Slovenian #195

Open PrimozGodec opened 1 year ago

PrimozGodec commented 1 year ago

Hi, during language refactoring of the orange3-text module, I noticed that NLTK use Slovene as a key to Slovenian stop words. I suggest remaining it to Slovenian. The reasons are the following:

I know there is a bit of confusion, but since two terms exist for Slovenian, Slovenian is definitely more common. I wanted to make a pull request but didn't find where stopwords are stored.

PrimozGodec commented 5 months ago

Any news on this one? Can you please let me know where is it defined, and I can propose a pull request?