Open AngledLuffa opened 12 months ago
Example 2 and 3 are ordinal number words, so should be XPOS=CD
with NumForm=Word|NumType=Ord
according to UD guidelines.
IIRC, NNP is used in the XPOS for compatibility with PTB. In this case, example 3 should match example 2. This gives a conflicting XPOS candidate (CD or NNP).
The cambridge dictionary classifies the ordinals as determiners (but notes that another determiner like "the" or "a" can preceed the ordinal):
However, wiktionary classifies them as adjectives:
Wikipedia doesn't mention ordinals as adjectives in the adjective order page:
But Wikipedia seems to agree with the Cambridge dictionary and not wiktionary on that page:
Determiners and postdeterminers—articles, numerals, and other limiters (e.g. three blind mice)—come before attributive adjectives in English.
Ordinal numbers should be ADJ: https://universaldependencies.org/u/pos/ADJ.html
So First_ADJ Opium War
? NNP
or JJ
for the xpos?
My hunch is NNP
World
and War
both NNP
? It looks very weird having them be PROPN
but NN
In "World War I", definitely "World" and "War" are NNP. I would lean that way also for "First World War", and that seems to be consistent with OntoNotes.
NNP or JJ for the xpos?
My hunch is NNP
Worth pointing out that in GUM, the 2002 World Cup gets the tag CD (not NNP). However, it might be considered not actually part of the name, I suppose.
... although they later annotate
Instruments for Research into Second Languages (IRIS)
Second City
with Second_NNP
How do the changes here look?
2002 World Cup gets the tag CD (not NNP)
I think that's canon, let me know if someone wants to argue it's not?
It seems weird that we have
but then also have
and