morfologik / morfologik-stemming

Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
BSD 3-Clause "New" or "Revised" License
187 stars 44 forks source link

"taić" as a non-reflective, imprefective form included twice #66

Closed mikolajz closed 8 years ago

mikolajz commented 8 years ago

E.g., the line is: taić taić verb:inf:imperf.perf:nonrefl+verb:inf:imperf:refl.nonrefl

As I understand, this expands to: taić taić verb:inf:imperf:nonrefl taić taić verb:inf:perf:nonrefl taić taić verb:inf:imperf:refl taić taić verb:inf:imperf:nonrefl

and the fact that the line "taić taić verb:inf:imperf:nonrefl" happens twice can confuse some programs.

dweiss commented 8 years ago

But we don't do this expansion in Morfologik, do we? And there is no easy way to "fold it in" since you'd generate verb:inf:perf:refl which isn't on the list.