stanfordnlp / stanza

Stanford NLP Python library for tokenization, sentence segmentation, NER, and parsing of many human languages
https://stanfordnlp.github.io/stanza/
Other
7.31k stars 896 forks source link

Differences with Java #163

Closed statsmaths closed 5 years ago

statsmaths commented 5 years ago

There seem to be a number of features currently missing in the Python implementation relative to the Java one. For example:

I was wondering if there were plans to incorporate these into Python, and if so whether there is a timeline for doing so.

yuhaozhang commented 5 years ago

You are right that currently there are a ton of functions in the Java CoreNLP but not in the Python toolkit. In the next release (coming soon), we will add NER supports for a range of major languages in the Python toolkit. A sentiment analysis annotator is also on our roadmap. For other functionalities such as quote attribution or coreference resolution, we do not have any plan at this point.

Note that if you want to stay in the Python world, but want to get CoreNLP annotations, you can always use our Python CoreNLP client interface.

statsmaths commented 5 years ago

Perfect, thanks for the update. Very helpful. Will look forward to the next release with NER!