Open parkmichelle opened 5 years ago
Look for instances where splitting text with .split(" "), tokenize with nltk instead or figure out how to tokenize with more than whitespace
ie need to split by punctuation too?
May also need to use porterstemmer, I think this might be a creative extension but not sure
Look for instances where splitting text with .split(" "), tokenize with nltk instead or figure out how to tokenize with more than whitespace