jonorthwash / ud-annotatrix

GNU General Public License v3.0
59 stars 49 forks source link

Punctuation arcs don't automatically get punct label #425

Open ftyers opened 3 years ago

ftyers commented 3 years ago

There used to be some defaults regarding punctuation and other things, now we don't have them.

jonorthwash commented 2 years ago

Is this the one that tried to guess attachments for punctuation, with a keyboard shortcut, written by @mr-martian in like 2018, or is something else?

What do you want the behaviour to be? Just any token whose content is ., ,, ;, !, ?, etc. should get POS of PUNCT and maybe deprel of punct even if not attached (edit: but each of those only if not already annotated; we don't want it overriding some other standard or sub-deprel or something)?

nschneid commented 2 years ago

Currently if a word is tagged PUNCT, creating an arc into it will default to the punct label. But it might be nice for the tagging to be automatic as well.