explosion / spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python
https://spacy.io
MIT License
30.15k stars 4.4k forks source link

Kyrgyz language stopwords #12632

Closed GrossmanLess closed 1 year ago

GrossmanLess commented 1 year ago

I'm from Kyrgyzstan, work on an article about Kyrgyz language stopwords. I'd like to change the list of Kyrgyz stopwords in SpyCy lib. I'm a beginner in contributing to GitHub. What should I do to complete my contribution?

Which page or section is this issue related to?

https://github.com/explosion/spaCy/blob/master/spacy/lang/ky/stop_words.py

adrianeboyd commented 1 year ago

Hi, we're planning to remove default stop word lists from spacy in the near future, so we're not currently accepting any PRs that modify stop word lists.

Stop word lists are task-specific and we've realized that it's not possible for us to maintain stop word lists in a sensible way for such a wide range of languages. Stop words will still be supported in general by spacy and you'll still be able to add custom stop words to your own pipeline, but spacy's default lists would be empty.

Here's how to customize stop words for your own pipeline in spacy v3: https://spacy.io/usage/linguistic-features#language-subclass

github-actions[bot] commented 1 year ago

This issue has been automatically closed because it was answered and there was no follow-up discussion.

github-actions[bot] commented 1 year ago

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.