spraakbanken / sparv-pipeline

Språkbanken's text analysis tool
https://spraakbanken.gu.se/sparv
MIT License
25 stars 6 forks source link

Adding Stanza for other languages than Swedish #114

Open heatherleaf opened 3 years ago

heatherleaf commented 3 years ago

The Stanford NLP documentation recommends changing to Stanza when working in Python:

We are actively developing a Python package called Stanza, with state-of-the-art NLP performance enabled by deep learning. Besides, this package also includes an API for starting and making requests to a Stanford CoreNLP server. It is the recommended way to use Stanford CoreNLP in Python. (https://stanfordnlp.github.io/CoreNLP/other-languages.html#python)

And since Stanza comes with pre-trained models for many languages, I suggest we move to Stanza as the default annotator.

anne17 commented 9 months ago

We started working on this a long time ago but never really finished. Things left to do: