Jinho has a lemmatizer for English that is driven off of WordNet data. It
needs to be run ahead of his dependency parser for best results (see issue
#126). Since it is functionally separate from the dependency parser it makes
sense to have a separate wrapper for this tool rather than coupling it to the
dependency parser wrapper.
We need a place in the type system for the resulting lemmas. I suggest that we
extend the Token type to have a feature called lemma of type String.
Original issue reported on code.google.com by phi...@ogren.info on 1 Feb 2011 at 7:32
Original issue reported on code.google.com by
phi...@ogren.info
on 1 Feb 2011 at 7:32