koaning / tokenwiser

Bag of, not words, but tricks!
https://koaning.github.io/tokenwiser/
Apache License 2.0
68 stars 7 forks source link

WordListFeaturizer #56

Open koaning opened 3 years ago

koaning commented 3 years ago

We could have a count vectorizer read in a text file to vectorize all those words upfront. Given a list of "bad words" I imagine we might help bootstrap a profanity detector.