learntextvis / textkit

Command line tool for manipulating and analyzing text
MIT License
28 stars 6 forks source link

Add lemmatization capabilities #36

Closed vlandham closed 8 years ago

vlandham commented 8 years ago

Input: Token document Output: lemmatized Tokens

vlandham commented 8 years ago

this is now part of the tokens2stem. We might want to pull this out into its own command.

Also, we need to check if the wordnet lemmatizer needs to download files (i think it does).