Open nschneid opened 1 month ago
Can you give a bit more explanation on what ExpPos means in this case or how the external deprel will be represented?
External POS: https://universaldependencies.org/en/feat/ExtPos.html
For example, "instead" is individually an ADV, but where it attaches as mark
, it is due to the fixed expression "instead of" acting as SCONJ. So in those cases it would receive ExtPos=SCONJ
.
(BTW this has also been implemented in GUM)
BTW2: In v2.14, most of the treebanks that use ExtPos
put the ExtPos
feature in the FEATS
column.
This includes the SUD native corpora, English-EWT, UD_Portuguese-Bosque and UD_Portuguese-GSD.
For consistency, it would be nice to have the same policy in others such as English-GUM.
For EWT I've just moved it to MISC following @dan-zeman's statement that FEATS should be reserved for properties of individual words, not larger units.
Yes, it's in MISC in GUM for the same reason.
New issue about standardizing ExtPos at the universal level: UniversalDependencies/docs#1037
Implemented in the above commit.
I've made some small updates to the English fixed
docs: see #317.
One question:
case
, but I'm wondering if this is CCONJ-like, cf. "rather than"I think "as opposed to" is like "rather than"—its coordination vs. prepositional function depends on context.
cc
(directly connects contrasted elements of like categories)case
The Core Group decided it would be a good idea for treebanks to specify how each
fixed
expression functions externally viaExtPos
in the MISC column.This is already implemented for a few expressions in EWT. We might as well expand to all of them. If the external deprel is correct, it can be used to infer the ExtPos (which is one of ADV, ADP, CCONJ SCONJ).