iljackb / Mixtepec_Mixtec

Mostly XML (TEI) markup of Mixtepec-Mixtec Language resources
3 stars 1 forks source link

In Alvarado Dictionary: the content in italics in (Jansen and Perez source) is POS #73

Open iljackb opened 5 years ago

iljackb commented 5 years ago

In the Jansen and Perez source, the content in italics inside the definitions/translations is generally part of speech info and should ideally be encoded in TEI as such, e.g.:

dzo ee nuu quaha: arreo (adverbio)

This is distinct however from info not in italics in the same source, which can be of various types, including domain, context, meronymy, semantic info, etc...

dzo indita tenduhua nica: a las 5 (madrugada)

dzo ndita caa quay: extender todo el cuerpo (man­dando dicen)

dzo quaha nda dzavua na saa canu: hasta tantos serán (respondiendo)

dzo neña yoquitu yotuvui iyondi: vivir pobre (por metáfora)