mideind / GreynirEngine

A fast, efficient natural language processing engine for Icelandic.
https://greynir.is
Other
60 stars 10 forks source link

Add POS tags from BÍN to variants returned by Reynir #3

Closed vthorsteinsson closed 6 years ago

vthorsteinsson commented 6 years ago

BÍN has POS tags such as present and past tense (NT, ÞT), and attached definite article (gr), which are not always returned in Reynir's variant lists since they are not significant for the parse as such. However this may well be useful information for Reynir clients. An augmentation feature should be added to Reynir that adds any significant missing BÍN tags to terminal variants before they are returned from Reynir, for instance in the _Sentence.terminals property.

vthorsteinsson commented 6 years ago

Added in version 1.1.4. The SimpleTree class now has an all_variants property in addition to the variants property. all_variants is augmented with variants (features) from BÍN. The _Sentence.terminals property now includes the all_variants list in its tuples, instead of the variants list.