johann-petrak / gateplugin-StringAnnotation

!!! OLD/OUTDATED, use https://github.com/GateNLP/gateplugin-StringAnnotation
GNU Lesser General Public License v2.1
7 stars 1 forks source link

Add parameter/setting to handle latin characters #5

Closed johann-petrak closed 6 years ago

johann-petrak commented 10 years ago

Add a parameter that will handle text/gazetteers with UTF-8 characters which have mainly one-byte encodings so that the UTF-16 characters used by Java will first be converted to bytes. This should make it possible to reduce the memory requirements for the trie to nearly half of what we need now.

johann-petrak commented 6 years ago

Not worth the effort, WONTFIX.