UniversalDependencies / UD_English-EWT

English data
Creative Commons Attribution Share Alike 4.0 International
199 stars 42 forks source link

Annotation errors in relative clauses incl. partitive elaborations on nominals #503

Open xiulinyang opened 9 months ago

xiulinyang commented 9 months ago

The following sentences are not relative clauses but the words in bold are marked with acl:relcl.

dependency relations are wrong for the tokens in bold.

not sure if they are correct or not/sentences are ungrammatical

nschneid commented 9 months ago

Thanks! Agree on most of the ones that should be acl.

  • The best there is in service

This one is tricky. "The best there is" is an idiom. I think it is a relative clause, and "best" stands for roughly "the best option". It's hard to make a paraphrase with "there is" that preserves the semantics/information structure though.

  • There must be a reason so many people like it there.

I think this is correct as expl. "It" is nonreferential.

  • Mike one of owners was awesome, he explained the detailed plan, and executed on time, I am always going use them and refer them to many friends I can because of the great job they did me.

Where is the issue here?

  • The dangers implicit in any attack upon that nation, however, seem to significantly offset whatever gains could be made in the so-called "War on Terror."

"whatever gains could be made (in the...)" is a free relative. The only issue I see in the basic tree is that PronType=Int should be Rel.

  • Overall they aren't very knowledge about the type of games are on the market.

Yeah, a relativizer is missing.

  • Your application is still evaluated on a number of criteria, not all of them as "cast in steel" as the points.

Grammatical ("cast in steel" is being used like an adjective: cf. "not all of them as rigid as the points" or "not all of which are as rigid as the points"). Will have to think about this but my hunch is it should be acl not acl:relcl.

In this day its rare to find such wonderful people who CARE , Not the kind of want to make cash

Yeah this is ungrammatical. Maybe "of" was supposed to be "who"? "of" is attached as dep suggesting incoherence.

xiulinyang commented 9 months ago

Thanks for the clarification! I think me in the sentence below should be iobj?

Mike one of owners was awesome, he explained the detailed plan, and executed on time, I am always going use them and refer them to many friends I can because of the great job they did me.

nschneid commented 9 months ago

Yes, I would agree with the iobj annotation though it sounds a bit nonstandard to my ear.

nschneid commented 9 months ago

Most of the ones that should be acl can be identified because they are non-finite clauses, or are finite but marked as a whole by a word like "after". Generalizing to queries:

nschneid commented 9 months ago

("the chance that" and vocative "my friends" were addressed in 495a62f)

nschneid commented 9 months ago

Ones I'm not sure about—@amir-zeldes?

Also, there are these items that strike me as syntactic blends: incorporated-particle-adjectives "ongoing" and "upcoming" coerced into acting like verbs. (I would have said "happening"/"going on" and "coming up" respectively.) Because these don't have bare verb lemmas I'm leaving them as ADJ and changing acl:relcl to amod.

amir-zeldes commented 9 months ago

I think parataxis - the use of a PRP in "a number of criteria, not all of them as "cast in steel" as the points: same" seems telling, since English generally avoids subj/obj resumptives in acls.

customers already on DA: would go with nmod, same as "customers on DA" plus an adjunct.

Ongoing etc. look like extraposed adj, like "a table similar to a chair", so +1

nschneid commented 9 months ago

I think parataxis - the use of a PRP in "a number of criteria, not all of them as "cast in steel" as the points: same" seems telling, since English generally avoids subj/obj resumptives in acls.

I guess it depends which acls we're talking about...a lot of them are noun complements, for example, but I don't see that as particularly relevant to this construction.

Looking at the verbless clauses (GUM, EWT), it looks like GUM puts several verbless clauses with "each"-subjects under acl (which seems like the same general construction minus the explicit partitive of-phrase). E.g.

"Some of them very touristic" is the only parataxis one of this kind in GUM. The other main verbless acl construction in both treebanks is the with-absolute construction (#240), which seems like a cousin of the one we're deciding.

Overall, among the verbless clauses, the ones under acl are more closely integrated with the head nominal than the parataxis ones. I'm not sure we want to make an exception to that based on pronouns.

amir-zeldes commented 9 months ago

Agreed, it should be uniform, and my tendency looking at the examples is to change acl to parataxis as parenthetical, except maybe the ones introduced by "with", since I think that's a kind of complementizer that suggests a real adjunct structure (prepositional, just mark because its arg is a clause).

Would you be on board with acl for "with X Y" and parataxis otherwise? It also gels with the advcl reading of "sit with your legs crossed" in #240 .

nschneid commented 9 months ago

OK. Now all verbless acls with subjects are with-absolutes in EWT.

nschneid commented 9 months ago

Actually, I just realized that this construction has a relative clause counterpart (GUM, EWT).

Taking the above example:

We can rephrase as:

I think the parallelism is a good reason to go with acl for the first one over parataxis.

(I'd say that parentheticals are not completely covered by parataxis. Really, relative clauses and appositives can be parentheticals. So parataxis serves as a fallback for parentheticals not covered by another relation.)

amir-zeldes commented 9 months ago

I'm not sure about that... I mean, sure, a relative paraphrase for that exists, but that's probably true of a lot of parataxes. What exactly is the criterion to make this one be acl? What I like about singling out "with" is that the criterion is fairly clear: it's an explicit complementizer, which realizes explicit syntactic subordination. I can't say that about things like:

There is a paraphrase:

And the latter is definitely a subordinating construction. But in the former example "(he's my cousin)" is what I'd call a paradigm example of a parenthetical, which should therefore be parataxis.

nschneid commented 9 months ago

"He's my cousin" can stand as an independent clause, so that's clearly parataxis. It seems to me that the verbless construction we're talking about is used more narrowly to expand on a nominal.

nschneid commented 9 months ago

Relatedly, verbless absolute constructions can occur in a clause with or without "with": "(With) his hands in the air, the suspect was arrested." Those are both advcl right? I think this shows that subject+verbless predicate constructions tend to be modifiers as opposed to parataxis.

amir-zeldes commented 9 months ago

"He's my cousin" can stand as an independent clause

What about:

I think that can't stand alone - is that not a parenthetical parataxis?

nschneid commented 9 months ago

I would say "Bob (my boss)" is appos. Is it appos with "formerly"? I guess it depends how strictly the reversibility criterion is enforced. If it's not appos then it's parataxis by default—not a clause right?

This seems related to the postmodifying age construction: "Smith, 42, was appointed". EWT, GUM are not consistent.

nschneid commented 9 months ago

To expand on the problem, below we have a range of paraphrases that tack on an elaboration about the applicants:

  1. Each year the program admits a dozen applicants, mostly international students.
  2. Each year the program admits a dozen applicants, most of them international students.
  3. Each year the program admits a dozen applicants, most of which are international students. - clearly a relative clause
  4. Each year the program admits a dozen applicants, most of them coming from outside the U.S.
  5. Each year the program admits a dozen applicants, mostly coming from outside the U.S.

How should lines be drawn here? It's not obvious to me.

For elaborating clauses, @amir-zeldes suggests explicit subordination (with a word like "with" or "if" or a relative clause) as a test for modification status (acl or advcl), with parataxis coming into play otherwise.

It seems to me that there is a construction—call it the Partitive Subject Elaboration construction—that (2-4) share in common. Maybe this is reason enough to consider them all some version of acl, or maybe the Partitive Subject Elaboration construction is orthogonal to the modification-vs-parataxis distinction in UD. Interestingly, a partitive subject depictive variant of (4) with an explicit verb could occur sentence-initially ("Most of the applicants coming from outside the U.S., a dozen are admitted each year") but I don't think that works with the verbless clause in (2) ("*Most of the applicants international students, a dozen are admitted each year"), and it certainly doesn't work with the relative clause in (3).

amir-zeldes commented 8 months ago

It seems to me that there is a construction—call it the Partitive Subject Elaboration construction—that (2-4) share in common.

1. is an apposition, and 5. is an adnominal gerund clause (acl) so I agree those are separate. I can see 2 and 4 as being the same, whatever we call them, but 3. is a relative clause, which is a different, well understood construction, and uniquely acl:relcl.

nschneid commented 8 months ago
  1. is an apposition

Wouldn't (1) be like "formerly my boss"? I thought you said no to appos for that because of the adverb?

To add another one to the mix:

6. Each year the program admits a dozen applicants, mostly from outside the U.S.

amir-zeldes commented 8 months ago

Oh I see... yes, OK, maybe that's also not revsersible, I wasn't sure. In any case, I think 3. is totally different, and 2. is probably a verbless version of 4., so I agree they should ideally be the same.

nschneid commented 8 months ago

Oh I see... yes, OK, maybe that's also not revsersible, I wasn't sure.

OK to be fair this is an acceptable sentence:

But the meaning changes: I would take this to mean that all of the dozen are international, and "mostly" is in reference to some understood larger superset.