dkpro / dkpro-core

Collection of software components for natural language processing (NLP) based on the Apache UIMA framework.
https://dkpro.github.io/dkpro-core
Other
196 stars 67 forks source link

[TreeTagger] Add a standard mapping for non-supported language #15

Closed reckart closed 9 years ago

reckart commented 9 years ago
At the moment, languages that are supported by the TreeTagger but do not yet have a
mapping to the DKPro type system cannot be used with the TreeTagger AE.

We should add a standard-mapping for non-supported languages that maps all 
POS-Tags to some general purpose annotation (I think "O" (=Other) is currently used
for non-mappable types). The original POS-Values can then be retrieved from PosValue-feature
of the O-Annotations.

This should not be seen as a replacement for a language mapping - but as a work around
for new languages until a new mapping to the DKPro-type system has been created.

Original issue reported on code.google.com by oliver.ferschke on 2011-05-09 22:12:21

reckart commented 9 years ago
If no mapping file can be found, a default mapping of "*" to O is set up.
---
Fixed in revision 387. 

Original issue reported on code.google.com by richard.eckart on 2011-10-24 18:23:37