UniversalDependencies / UD_English-EWT

English data
Creative Commons Attribution Share Alike 4.0 International
199 stars 42 forks source link

"a couple NOUN" #405

Closed nschneid closed 1 year ago

nschneid commented 1 year ago

The quantity expression "a couple" can immediately precede the noun quantified—like "a few", "a little". (Contrast "a lot", which requires "of".)

It is handled inconsistently:

nmod seems incorrect as there is no preposition.

In #170 we decided to stick with the status quo for "a few"/"a little"—treat both article and adjective as headed by the noun, with det and amod.

But "couple" is a noun. Should it be considered a NUM? (And if so, shouldn't other approximate quantities—"lot", "bunch", "handful", etc. also qualify?)

If "couple" and other approximate quantities are not NUM, then it seems odd to attach them as nummod. Currently "couple" is among very few items attaching as nummod with no NUM or NumType: EWT, GUM

Syntagmatically, "couple" and "few" seem to fill the nummod slot—they cannot cooccur with another nummod, and "a" cannot coocurr with a definite article:

Another option is fixed for all these "a" + quantity noun expressions that occur prenominally—treating them as multiword DETs or NUMs. But that doesn't solve "the couple weeks since December". And at the time of #170 we decided to avoid rocking the boat.

Is compound the least bad solution, by analogy to amod for "few"/"little"?

amir-zeldes commented 1 year ago

I see the issue. I think NUM/nummod is not right, because if "few" isn't a NUM, the neither is "couple" IMO, since it's not literally two, it's non-exact (and "many" etc. are also not NUM).

If we accept that it remains NOUN and is similar to "(a) few" (which is adjectival), then I think the better label here would be nmod:npmod, which is used elsewhere for extent modifiers. Since nouns normally take articles, I would attach the "a" as usual, to "couple", and consider it an extent modifer NP.

nschneid commented 1 year ago

nmod:npmod for MEASURE + NOUN (without "of") appears to be the policy for the multiplicative noun "times" as in "it is eight times the price", as well as occasional percentages ("Genius is one percent inspiration", "it is 100% a scam"). EWT, GUM

So I guess extending this to "a couple NOUN" makes sense in practice. It's a bit linguistically unsatisfying as a theory of how English noun phrases are formed, because some of these nmod:npmods and amods ("a few") substitute for determiners, but we'd need to do a bigger overhaul of the treatment of det to account for that.

nschneid commented 1 year ago

Implemented nmod:npmod for EWT

amir-zeldes commented 1 year ago

OK, GUM matches upstream