dkpro / dkpro-core

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
Other
196 stars 67 forks source link

Integrate NLP4J (successor to ClearNLP) #792

Closed reckart closed 7 years ago

reckart commented 8 years ago

Integrate NLP4J, the successor to ClearNLP.

This version supports tokenization, part-of-speech tagging, morphological analysis, named entity recognition, and dependency parsing. The next release (March, 2016) will include supports for semantic role labeling and sentiment analysis, and the following release (April, 2016) will include supports for coreference resolution. NLP4J makes it easy to train your own model. Please see how to train for more details about the training process. Calling the decoding API is easier than ever. See NLPDemo for more details. The biggest difference between NLP4J and ClearNLP is in machine learning. NLP4J is capable of updating existing models with new training data, which is useful for domain adaptation. We also started implementing a deep learning package although we realized that the GPU support for Java is pretty limited and without a good GPU support, deep learning would make everything much slower. Please let us know if you'd like to contribute for this project. One could consider the NLP4J project is a more stabilized version of ClearNLP. I have been using this package for the NLP course I teach, and my students (including undergrads) were able to develop new NLP components without much effort using the built-in APIs in NLP4J. We are preparing a tutorial for developing NLP components using NLP4J. We do not expect our tools would work perfectly out of box. We now have a good team working on this project. Please let us know if you'd like to collaborate so we can make this project more robust for you.


reckart commented 8 years ago

Upgrade to the new NLP4J 1.1.0: https://groups.google.com/d/msg/emorynlp/S679fMz4ABs/HlafKrKfAQAJ

mjunsilo commented 7 years ago

What is the status for integration of NLP4J coreference resolution? Its mentioned in the description, but is not present in the feature list above or the current master branch.

reckart commented 7 years ago

I believe that coref has not even been integrated upstream in NLP4J. I also think that NLP4J is no longer being developed and deprecated in favor of another framework that Jinho has under development. But I am not entirely sure...

mjunsilo commented 7 years ago

Thanks for the hint. We were currently just trying out and comparing various alternative components.