Adding an Option for Removing Turkish Stop Words During Preprocessing

GlobalMaksimum / sadedegel

A General Purpose NLP library for Turkish

MIT License

93 stars 15 forks source link

Adding a text file having Turkish stop words and an option for removing them during text preprocessing can be useful. It would also benefit sadedegel by making it closer to state of art NLP libraries for English.

I found a work that was done on Turkish stop words (on this link https://github.com/ahmetax/trstop). We can use the text file with Turkish stop words there. Then by using the list of stopwords we can make changes in the code for giving the user an option for possibly removing them during the preprocessing stage.

GlobalMaksimum / sadedegel

Adding an Option for Removing Turkish Stop Words During Preprocessing #280