dkpro / dkpro-uby

Framework for creating and accessing UBY resources – sense-linked lexical resources in standard UBY-LMF format
https://dkpro.github.io/dkpro-uby
Other
22 stars 3 forks source link

IMSLex-Subcat: Convert auxiliaries #146

Open chmeyer opened 9 years ago

chmeyer commented 9 years ago

They are currently missing. I have already prepared some code to extract the auxiliary from the original IMSLex files, but we need to decide how to represent verbs that take both "haben" and "sein". AFAIK, they are specially tagged with "-variant"(?). Currently we do not have an enum value for having both auxiliaries in EAuxiliary. Solutions may be

judithek commented 9 years ago

auxiliary and subcat frame together constitute a large part of a verb sense they may not be separated.

haben-variant should be represented as haben, sein-variant as sein

the aux. information goes into the LexemeProperty which is linked to Sense

judithek commented 9 years ago

so it would be your second suggestion, see also the Subcat frame class (-> modeling that subcat frame and aux. belong together):

// LexemeProperty of this SubcategorizationFrame
@VarType(type = EVarType.CHILD)
private LexemeProperty lexemeProperty;