qwaider / morphisto

Automatically exported from code.google.com/p/morphisto
0 stars 0 forks source link

analysis of compounds, where at least the last element is a known word. #60

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
I have words like Eichdorffschule, Leunabrücke, etc... The last part is a 
valid German substantive, that could be inflected, like Eichdorffschulen, 
Leunabrücken. Were it possible, that morphisto not simply says, not known, but 
it would pass back the dictionary form, for example: Leunabrücken-> 
Leunabrücke Eichdorffschulen->Eichdorfschule ? Or even better, it could pass 
back the analyzed word divided into parts like: Eichdorff<NN>schule<+NN>... 

Original issue reported on code.google.com by eleonor...@gmx.net on 1 Sep 2011 at 10:14

GoogleCodeExporter commented 9 years ago
you probably *could* implement this through wildcards (allowing ".*" as prefix 
in smor.fst). However, such a change conflicts with major applications for a 
morphological analyzer (e.g. error correction), and should not be added to the 
trunk.

Original comment by rico.sen...@googlemail.com on 5 Sep 2011 at 9:05

GoogleCodeExporter commented 9 years ago
There is a discussion in the wiki about this.
Mainline morphisto would pass back for XXXbrücke:
{XXX}-brücke
{} indicates: unknown word.
the {} mechanism is already implemented in morphisto.

Original comment by eleonor...@gmx.net on 5 Sep 2011 at 9:13

GoogleCodeExporter commented 9 years ago

Original comment by wuerz...@gmail.com on 8 Sep 2011 at 3:15

GoogleCodeExporter commented 9 years ago

Original comment by wuerz...@gmail.com on 8 Sep 2011 at 3:15