inpho / vsm

Vector Space Model Framework developed for InPhO
http://inpho.github.io/vsm
Other
35 stars 14 forks source link

nltk stoplist to take languages other than English #76

Closed doori closed 10 years ago

doori commented 10 years ago

Currently in apply_stoplist, the nltk_stop parameter takes a boolean and adds stoplist for English as default.

This could be changed to nltk_stop='english' as default and take other languages, list of languages or None for more flexible corpus cleanup.

doori commented 10 years ago

Pushed in vsm/extensions/corpuscleanup.py in master branch.