htrc / torchlite-backend

Backend API service for Torchlite web dashboard
2 stars 0 forks source link

implement stopword filter #8

Open cwulfman opened 1 year ago

cwulfman commented 1 year ago

Use NLTK English stopword list in initial implementation.

Then make it possible to choose from other stopword lists from a database of stopword lists.

jswatsch commented 2 months ago

These are the stopwords lists we are using: https://github.com/nltk/nltk_data/blob/gh-pages/packages/corpora/stopwords.zip.