qwaider / morphisto

Automatically exported from code.google.com/p/morphisto
0 stars 0 forks source link

Stems starting with the prefix "ge" should not occur. #15

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Many non-regular verbs have wrong past participle entries (VVPP):

<BaseStem> <Lemma>flecht</Lemma> <Stem>geflocht</Stem> 
<Pos>V</Pos><Origin>nativ</Origin> <InfClass>VVPP-en</InfClass> 
<Frequency>16</Frequency> </BaseStem>

This results in wrong analyses as of "vergeflochten" (which should have no 
analysis). The correct lexicon entry would be

<BaseStem><MorphMarker>ge</MorphMarker>
<Lemma>flecht</Lemma> <Stem>flocht</Stem> <Pos>V</Pos><Origin>nativ</Origin> 
<InfClass>VVPP-en</InfClass> <Frequency>16</Frequency> </BaseStem>

Work on this issue has begun in the 'kmw' branch.

Original issue reported on code.google.com by wuerz...@gmail.com on 20 May 2011 at 2:10

GoogleCodeExporter commented 9 years ago
All entries with stems starting with "ge" have been moved to 
'basestems_waste.xml'. Corrected entries have been added to 'basestems_new.xml' 
which is included into the build in the 'kmw' branch.

@CWRSimon: Could you please verify this strategy? We could carry this over to 
the trunk.

Original comment by wuerz...@gmail.com on 26 May 2011 at 4:13

GoogleCodeExporter commented 9 years ago
Push

Original comment by wuerz...@gmail.com on 9 Sep 2011 at 2:17