Open GoogleCodeExporter opened 9 years ago
ClearNLP version 3.1.0 is released.
A new component for named entity recognition is added, which shows
state-of-the-art accuracy on both CoNLL'03 and OntoNotes data (a paper
describing our approach is under submission).
All statistical models are upgraded; the part-of-speech tagger and the
dependency parser use features extracted from distributional semantics, which
give more robust results on unseen data.
The dependency parser is trained on data from our new dependency conversion
adapting many concenpts from the universal dependency structures and
introducing some new useful labels such as "dative".
Components for semantic role labeling and coreference resolution will be added
in June. Please let me know if you have any question/suggestion on ClearNLP.
Thank you!
best,
Jinho
Original comment by richard.eckart
on 30 Apr 2015 at 8:36
ClearNLP 3.1.1 is released.
Word embedding lexicons are removed from the global lexica, which didn't add
much accuracy but took so much RAM space. Furthermore, the gazetteers for
named entity recognition are now separated from the global lexica for better
modulation (see models for more details).
The core dictionary is updated; some past-tense verbs recognized as base verbs
are now fixed.
The named entity recognition model is updated.
See pom.xml for all updated maven dependencies.
We'll be making many more good updates in the summer so please stay in tune.
Thank you!
best,
Jinho
Original comment by richard.eckart
on 8 May 2015 at 1:13
Original issue reported on code.google.com by
richard.eckart
on 24 Mar 2015 at 10:53