-
Before upgrading to NLP4J, we got this parse:
```
"[ 0, 2]NSUBJ(nsubj) D[0,2](We) G[3,7](need)",
"[ 3, 7]ROOT(root) D[3,7](need) G[3,7](need)",
"[ 8, 9]DET(det) D[8,9](a) G[35,43](sentence)",
"…
-
When looking at [this file](https://github.com/emorynlp/nlp4j/blob/master/md/supplements/data-format.md), I noticed that the lemmatized column in the example doesn't match with the word ("founder" lem…
-
re: clir/clearnlp#9
Dr. Choi,
I'm using ClearNLP at the moment, but needed to be able to preserve/mark up the original text. The easiest way to do this for me was to use another tokenizer that genera…
-
Good practice in tokenization is to make the tokenization pipeline information preserving, in the sense that you can always recover the original form of the input document, including details of whites…
cbrew updated
9 years ago