Open odanoburu opened 5 years ago
how should the input for this tool normally be processed? we need it to be at least tokenized and lemmatized; the identification of MWEs would also be of interest.
I don't know the support of NLTK, but since you mentioned, it can be an alternative. Another one can be Freeling.
We need to test, take a corpus, produce some output to discuss further.
how should the input for this tool normally be processed? we need it to be at least tokenized and lemmatized; the identification of MWEs would also be of interest.