UniversalDependencies / docs

Universal Dependencies online documentation
http://universaldependencies.org/
Apache License 2.0
274 stars 249 forks source link

zh/PRON: add PronType=Ind/Int #977

Closed kirianguiller closed 1 year ago

kirianguiller commented 1 year ago

Hi everyone,

Our team in Paris Nanterre University (Modyco laboratory) is working on a SUD Mandarin treebank (link here). Altough we only annotated 60% of the treebank, we started working on the automatic grew conversion rules (written here) in prevision of the next UD release at the end of the month.

In our treebank, we were annotating both wh-question words and wh-indefinite words as PRON with a PronType feat of Int or Ind. This annotation is different from what has been done in the UD_Chinese-HK treebank, as the indefinite pronouns were annotated as DET.

We think the UD Mandarin treebank would benefit from this annotation as it would help queries based on the PronType.

We are willing to listen to any advice on the situation.

PS : as a sidenote, in the UD_Chinese-HK treebank, we can find some errors of annotation for these wh-question/wh-indefinite. If we follow the current UD Mandarin guidelines, we can still find, thanks to this grew cluster query some inconsistencies. Indeed, some wh-question words were annotated as DET (wh-indefinite) (for instance this sentence), it occurs for around 50% of the 23 annotated wh-indefinite. We can help to correct these.

kirianguiller commented 1 year ago

Thank you !