dkpro / dkpro-core

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
Other
196 stars 67 forks source link

[TreeTagger] Intern POS values and lemmas to save memory #18

Closed reckart closed 9 years ago

reckart commented 9 years ago
Per default the TreeTagger wrapper should intern POS values and lemmas to save memory.
It should be an option however, as somebody may not want to incur the additional overhead.

Original issue reported on code.google.com by richard.eckart on 2011-05-29 08:53:01

reckart commented 9 years ago
Added the parameter PARAM_INTERN_STRINGS to TreeTaggerTT4JBase - default is "true"
Enabled the hugeDocumentTest for TreeTaggerPosLemma.
Added another test for English.
Upgrade to modern uimaFIT API in TreeTaggerPosLemmaTest.

Original issue reported on code.google.com by richard.eckart on 2011-05-29 08:56:14

reckart commented 9 years ago
Fixed.
---
rev 107

Original issue reported on code.google.com by richard.eckart on 2011-10-02 15:11:13