0xCAB / morphisto

Automatically exported from code.google.com/p/morphisto
0 stars 0 forks source link

Missing prefixes #61

Open GoogleCodeExporter opened 8 years ago

GoogleCodeExporter commented 8 years ago
If a linguistic problem:
What wordform makes the faulty analysis occur?
Dioxid, Trioxid unknown.

Please add:
<Pref_Stems>tri<PREF><ADJ><fremd,klassisch,nativ>
<Pref_Stems>tri<PREF><NN><fremd,klassisch,nativ>
<Pref_Stems>di<PREF><ADJ><fremd,klassisch,nativ>
<Pref_Stems>di<PREF><NN><fremd,klassisch,nativ>

to the lexicon.

Original issue reported on code.google.com by eleonor...@gmx.net on 1 Sep 2011 at 10:15

GoogleCodeExporter commented 8 years ago
<Pref_Stems>pan<PREF><ADJ><nativ>
<Pref_Stems>pan<PREF><NN><nativ>
<Pref_Stems>erdbeer<PREF><NN><nativ>
Erdbeerschnitten
<Pref_Stems>leer<PREF><V><nativ>
leerlaufen

Original comment by eleonor...@gmx.net on 2 Sep 2011 at 3:37

GoogleCodeExporter commented 8 years ago
<Pref_Stems>latz<PREF><NN><nativ>

Original comment by eleonor...@gmx.net on 2 Sep 2011 at 4:00

GoogleCodeExporter commented 8 years ago
<Pref_Stems>aufrecht<PREF><V><nativ>

Original comment by eleonor...@gmx.net on 5 Sep 2011 at 6:17

GoogleCodeExporter commented 8 years ago
<Pref_Stems>er<PREF><V><nativ>
erfinden
<Pref_Stems>bodybuildingPREF><ADJ><fremd>
bodybuildingorientiert
<Pref_Stems>anders<PREF><ADJ><nativ>
andersfarbig
<Pref_Stems>dumpf<PREF><ADJ><nativ>
dumpfdräuend
<Pref_Stems>geo<PREF><ADJ><nativ>
geowissenschaftlich
<Pref_Stems>grün<PREF><ADJ><nativ>
<Pref_Stems>blau<PREF><ADJ><nativ>
<Pref_Stems>rot<PREF><ADJ><nativ>
<Pref_Stems>schwarz<PREF><ADJ><nativ>
<Pref_Stems>weiß<PREF><ADJ><nativ>
<Pref_Stems>metallic<PREF><ADJ><nativ>

Original comment by eleonor...@gmx.net on 7 Sep 2011 at 6:07

GoogleCodeExporter commented 8 years ago
<Pref_Stems>schluss<PREF><V><nativ>
<Pref_Stems>schluß<PREF><V><nativ>
schlussfolgern
<Pref_Stems>sudeten<PREF><NN><nativ>
<Pref_Stems>sudeten<PREF><ADJ><nativ>
<Pref_Stems>ziegen<PREF><ADJ><nativ>
<Pref_Stems>unten<PREF><ADJ><nativ>
untenstehend
<Pref_Stems>un<PREF><V><nativ>
unhinterfragt
<Pref_Stems>teer<PREF><ADJ><nativ>
<Pref_Stems>straf<PREF><ADJ><nativ>
<Pref_Stems>stakkato<PREF><ADJ><nativ>
stakkatoartig

Original comment by eleonor...@gmx.net on 8 Sep 2011 at 7:50

GoogleCodeExporter commented 8 years ago
Most of the proposals are content words. Analysis should be covered by the 
composition mechanism. For example "dumpf":
> dumpfklingend
dumpf<ADJ>klingen<V><PPres><SUFF><+ADJ><Pos><Adv>
dumpf<ADJ>klingen<V><PPres><SUFF><+ADJ><Pos><Pred>

I accept "er", which I already added to branches/kmw/src/prefixes_new.xml, 
"di", "pan" and "tri" (which possibly should only have origin "fremd"). For 
"erdbeer", I propose to add a KomposStem which maps to "Erdbeere" (cf. "Bundes" 
in branches/kmw/src/basestems_new.xml). The same applies to "unten" and 
"straf". "Teer" is missing as noun as well as "Stakkato". For the adjectives, I 
have to look into deko.fst. It might be feasible to have an "adj+verb" 
composition rule.

Original comment by wuerz...@gmail.com on 8 Sep 2011 at 3:29

GoogleCodeExporter commented 8 years ago
<Pref_Stems>poly<PREF><ADJ><nativ>
<Pref_Stems>radon<PREF><ADJ><nativ>
radonstärkste
<Pref_Stems>nimmer<PREF><ADJ><nativ>

Original comment by eleonor...@gmx.net on 8 Sep 2011 at 3:32

GoogleCodeExporter commented 8 years ago
<Pref_Stems>genauso<PREF><ADJ><nativ>
genausowenig

Original comment by eleonor...@gmx.net on 9 Sep 2011 at 4:31

GoogleCodeExporter commented 8 years ago
<Pref_Stems>teil<PREF><ADJ><nativ>
teilweise
<Pref_Stems>schieß<PREF><NN><nativ>
Schießbude

Original comment by eleonor...@gmx.net on 12 Sep 2011 at 4:30