Closed AmeliaMN closed 10 years ago
@AmeliaMN Do we make changes to the ProcessText.R to solve this?
This is one of the more amorphous tasks... I don't have a great idea of how to do it. Maybe we need to split ProcessText into two functions, one that does all the other processing and one that does stop words? That would make it more clear that the new parameter (called something like "list" maybe) referred specifically to stop words. E.g. RemoveWords(text, list="english") RemoveWords(text, list="SMART") RemoveWords(text, list=c("the", "a", "my"))
Rename the 'RemoveWords' function (which I haven't pushed yet, sorry Amelia) to UpdateStopwords.
E.g., change from "english" to "SMART" and/or modify the list itself (pass a vector of words that you want removed from stopword list or create your own stopword list).