dkpro / dkpro-core

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
Other
196 stars 67 forks source link

Add Stockholm University Swedish POS Tagger #991

Closed munterkalmsteiner closed 1 year ago

munterkalmsteiner commented 7 years ago

Swedish POS Tagger that uses the Stockholm-Umeå Corpus and tag set (http://www.ling.su.se/english/nlp/tools/stagger). Could then be used with the Maltparser and model which uses the same corpus and tag set. Mostly implemented and currently testing.

reckart commented 7 years ago

What would be the name/artifactId of the new module? I would suggest dkpro-core-stockholm?

Please mind that for new components, we now use the package name org.dkpro.core.XXX, and the Maven GroupId in new modules should be org.dkpro.core.

munterkalmsteiner commented 7 years ago

I'd rather use the upstream name "stagger" rather than "stockholm". Then, I've followed the naming scheme from stanford NLP, i.e.:

I can change that to:

Would that work?

reckart commented 7 years ago

That's fine.

reckart commented 5 years ago

@munterkalmsteiner I wonder if you somehow forgot about this? ;)

munterkalmsteiner commented 5 years ago

@reckart Indeed. I'll check the email you sent me on December 1 2016 with instructions on how to package the model. I think this was the last thing that was missing. PR will come in the coming days/weeks.