melaniewalsh / Intro-Cultural-Analytics

Introduction to Cultural Analytics & Python, course website and online textbook powered by Jupyter Book
https://melaniewalsh.github.io/Intro-Cultural-Analytics
GNU General Public License v3.0
253 stars 87 forks source link

Add non-English stopwords for multilingual text analysis #15

Open melaniewalsh opened 3 years ago

melaniewalsh commented 3 years ago

We need to add information about how to use non-English stopwords for topic modeling and TF-IDF

igorbrigadir commented 3 years ago

I've used https://github.com/stopwords-iso/stopwords-iso in the past - it's got a bunch of languages in case you need the lists.

melaniewalsh commented 2 years ago

Thank you, @igorbrigadir! Just got around to checking this out, and it looks great. I'm looping in @quinnanya just to make sure she knows about this resource, too