IAHLT / UD_Hebrew

Hebrew Universal Dependencies Treebank
Other
2 stars 2 forks source link

Deprel of Emphatic Pronouns #37

Open NathanD38 opened 3 years ago

NathanD38 commented 3 years ago

@amir-zeldes

The deprel between a nominal and an emphatic pronoun in the current HTB is nmod. This is the standard behavior of the parser as well. To my understanding, this deprel cannot directly modify a nominal without the mediation of a case-assigned ADP.

None of the English corpora actually assigns the feature PronType=Emp, even in cases where it clearly functions as such. The English corpus, GUM, assigns nmod:npmod to emphatic pronouns, but assigns the feature PronType=Prs.

In the guidelines, I erroneously equated emphatic pronouns to determiners such as oto, the same one, and kulo, the entire/whole one, i.e., det-assigned PRON. I do not think that emphatic pronouns function, either syntactically or semantically, in the same manner as the aforementioned determiners.

Should we revise the current HTB, and current and following batches, so that emphatic pronouns receive deprel nmod:npmod and PronType=Emp?

amir-zeldes commented 3 years ago

Yes, that's exactly right - because we restrict nmod to PP-modifiers, we should use nmod:npmod for these. BTW that's not a universal guideline, languages that don't adopt an npmod-style subtype often use nmod without case - it's kind of a luxury to be able to reserve it just for PPs, but I think it's worth doing because it makes things much clearer. PronType=Emp is nice to have too, and probably not hard to add also for English (GUM largely tries to do the same thing as EWT with the morphology, so we just haven't gotten to it yet)

NathanD38 commented 3 years ago

@amir-zeldes

Thank you! I'll add that to the revision list. :)