UniversalDependencies / docs

Universal Dependencies online documentation
http://universaldependencies.org/
Apache License 2.0
271 stars 245 forks source link

Right-to-left apposition with pronoun due to fronting? #751

Closed nschneid closed 1 year ago

nschneid commented 3 years ago

Currently the validator enforces a restriction that appos must go from left to right, with the justification that the apposition construction is about elaboration, hence the elaborating information must go second in the sentence (#510).

But occasionally there is an example like the following (from English-EWT):

This looks to me like a fronted appositive. It means the same thing as "He, the former leader..., was also a spiritual leader of Al Qaeda", and it is odd to say that "he" is an elaboration of "the former leader". "He" functions to connect the subject of the sentence with old information (a previous mention) and "the former leader..." is the new information.

Should this be considered a legitimate case of right-to-left-apposition, i.e. appos(he, leader-1)?

amir-zeldes commented 3 years ago

I don't think it's the same construction as an apposition. At least for me, I feel like there is an added predication here, i.e. that this is an elliptical version of:

"Being the former leader ..., he was also..."

So I would choose advcl here, so advcl(leader2,leader1). I wouldn't want to allow RTL apposition because:

We have a couple of these in GUM as well, and have gone with this advcl analysis in the past.

nschneid commented 3 years ago

Hmm, it doesn't feel to me like advcl because it applies specifically to "he".

A depictive adjective is the closest non-appositive construction I can think of: "Eager to leave, he forgot his umbrella". That would be acl, not advcl. But it feels very wrong to have a nominal expression attach as acl. And I think it's reversible like other uses of appos, it's just clear that the first part is the new information rather than the elaboration.

nschneid commented 3 years ago

What about dislocated?

sebschu commented 3 years ago

+1 for dislocated which is also used for attaching general in this example:

Bush, in answering the question about the leader of Pakistan, also said: "The new Pakistani general, he's just been elected -- not elected, this guy took over office.

But then it should also be attached to the root of the main clause, i.e., dislocated(leader2, leader1).

nschneid commented 3 years ago

There's a clear stylistic difference: the Pakistani general one sounds like the conversational/spoken language introduction of a topic before starting a complete sentence, whereas the Egyptian leader one is leading with a new descriptor of the previously mentioned entity, more like an appositive or predicative NP.

When the subject of the sentence has the same referent as the topic it may be hard to distinguish these. But consider

That's clearly the conversational-topic one. We presume the bicycle wasn't already topical in the discourse (if it were the speaker would not have needed the preface). But it was probably mentioned earlier, hence the definite reference.

So at some level these feel like different constructions. But using dislocated for both may be the best option.

sebschu commented 3 years ago

Oh, I hadn't realized that this was an example of this newspaper practice of introducing new information about an entity by changing the referring expression before I looked at the context.

But I think syntactically, it is still a topic phrase (because of the additional he as the subject), so I think dislocated is appropriate here.

amir-zeldes commented 3 years ago

I also think they're syntactically different, and notice the dislocated Pakistani General can't be made indefinite; if you do it, it becomes the 'added information' construction. Compare:

But:

amir-zeldes commented 3 years ago

Oh, I hadn't realized that this was an example of this newspaper practice of introducing new information about an entity by changing the referring expression before I looked at the context.

But I think syntactically, it is still a topic phrase (because of the additional he as the subject), so I think dislocated is appropriate here.

The reason I prefer advcl to dislocated here is that in the latter, the hanging topic should serve the same function as the second mention, usually a pronoun, so things like:

In both these cases, the dislocation matches the pronoun in definiteness, and doesn't add any adverbial flavor (e.g. causal). This is not true of the 'added information' construction, which can be interpreted e.g. as cause, and does not need to match in definiteness:

This seems very close to an elliptical version of:

nschneid commented 3 years ago

But it's also true of the standard appositive, right?

I take it those would be appos. "Being" could be inserted to make it advcl.

amir-zeldes commented 3 years ago

@nschneid yes, I agree appos also fulfills that, so I think it's better than dislocated. It seems the main reason I picked advcl is just an intuition that it feels different from appos, but that's not a serious argument. I guess the only way to really prove a difference between this construction and normal apposition is if it is separable, so if you could say:

If this is grammatical, I would say it's not appos, since appos should not be separable by an argument (but for advcl this would be fine)

amir-zeldes commented 3 years ago

OK, I looked around a little and it may be possible to find non-adjacent cases. For example I think this one can't be appos:

I think in a case like this, appos doesn't work due to non-adjacency, but a reading of "ever (being) the optimist" allows a fairly standard use of advcl. It also helps explain what an adverb like 'ever' is doing there. But for adjacent cases, maybe appos is still simpler?

sylvainkahane commented 3 years ago

It must be noted that it is a construction specific to English, or at least with specific constraints in each language. For instance the first example of @nschneid could not be translated in French with a definite article:

The former leader of the Egyptian Al-Gamaa al-Islamiya ("Islamic Group"), he was also a spiritual leader of Al Qaeda.
*L'ancien chef du groupe égyptien Al-Gamaa al-Islamiya ("groupe islamique"), il était également un chef spirituel d'Al-Qaida.
Ancien chef du groupe égyptien Al-Gamaa al-Islamiya ("groupe islamique"), il était également un chef spirituel d'Al-Qaida.

And these constraints are not the constraints we observe neither for apposition, nor for dislocation.

nschneid commented 3 years ago
  • “Maybe she really does just need a little space ...,” Amy said, ever the optimist. (The Body in the Casket: A Faith Fairchild Mystery/Katherine Hall Page, 2017)

I think in a case like this, appos doesn't work due to non-adjacency, but a reading of "ever (being) the optimist" allows a fairly standard use of advcl. It also helps explain what an adverb like 'ever' is doing there.

I suspect many appositives/parentheticals bear resemblance to predicates and can be modified by adverbs:

(a) John, (recently) a widower, said... (b) (Recently) a widower, John said... [clause-initial preferatory construction as in the Pakistani general example] (c) I told John, (recently) a widower. (d) I told a widower, John. (different pragmatics? cf. I told a widower, namely John.) (e) *I told [recently a widower], John.

My guess is that annotators would be very tempted to call all of these appos. @amir-zeldes, do you want adverb-insertability to be a test for advcl?

amir-zeldes commented 3 years ago

I see your point, I guess the adverb is not really a good argument then. I do agree with the annotators as you said though, I think appos is probably the best we can do here. The "ever the optimist" example was mainly meant to show that some cases are separable, and for those cases I think advcl (assuming an elliptical predicating "being") is the most elegant thing we can do with the current set of labels.

Since I don't think it's worth adding a label for these very rare cases, I would prefer appos for adjacent cases not involving a pronoun (which is usually not reorderable), dislocated for the lex NP in pronominal ones, and advcl for non-adjacent:

The pronominal cases with prefixed lexical subject are BTW very common in some languages (Coptic) and obligatory in some tenses in some languages, e.g. Hausa, where we must specify a pronominal person-aspect complex, even if a lex NP subject is also specified.

nschneid commented 3 years ago
  • Amy said, ever the optimist/advcl, that ...

That's ungrammatical for me, or at least exceedingly odd. I think if "ever the optimist" comes after "said" it has to be at the end of the sentence.

For

I can see advcl as a hacky workaround to compensate for the fact that English appositions don't actually respect the contiguity constraint 100% of the time.

Personally, I'd rather change the contiguity constraint to a preference instead of doing gymnastics to work around it.

Maybe @dan-zeman wants to weigh in?

nschneid commented 3 years ago

For completeness, we should also consider whether "ever the optimist" should be dislocated on the right, if we treat the Pakistani general example as dislocation on the left.

I will note that Wikipedia classifies these as appositions; these are NOT the same as left dislocation and right dislocation as defined here, which are like "My neighbor, he's great" discussed above—"my neighbor" is not adding information about "he", but rather, bringing up a discourse-new entity that can then be referred to with "he".

It occurs to me that the contraction with the copula shows that "he" is directly the subject, not a modifier of "neighbor": you wouldn't say "Amy, ever the optimist's, hopeful for the future"—to use the contracted copula on the subject you'd have to say "Ever the optimist, Amy's hopeful for the future".

dan-zeman commented 3 years ago

"Blah blah blah," Amy said, ever the optimist.

To me, advcl looks much more natural here than appos. That is, advcl(said, optimist) instead of appos(Amy, optimist)

What Wikipedia says about apposition is probably irrelevant because if anything is clear, then it is the fact that various authors have much broader notion of apposition than the one used in UD :-)

nschneid commented 3 years ago

@dan-zeman In that case would you say "Ever the optimist, Amy said...." should be advcl, dislocated, or appos?

dan-zeman commented 3 years ago

It would be the same for me, i.e., advcl.

However, it now occurs to me that it also resembles secondary predication (optional depictive), in which case the UD guidelines say it is acl(Amy, optimist). Even here, I don't think the word order matters.

nschneid commented 3 years ago

Ah yes, it does resemble a depictive. I'm starting to wonder whether the notion of apposition is somewhat orthogonal to the kinds of relationships expressed by the core UD relations (nmod, acl, advcl, conj, etc.). Canonical cases of apposition seem to involve (1) elaboration of a nominal with a modifier that is (2) parenthetical and (3) nominal. (3) is in common with depictive acl. Maybe we need a definition for "parenthetical" to separate that from the criteria for apposition. Is it relevant that two of these seems odd:

I.e. maybe one interpretation of comma-separated "ever the optimist" is parenthetical and another isn't? I admit I don't know much about the syntax/information structure of parentheticals. :)

BTW apparently nonrestrictive relative clauses in English are also called "appositive relatives". This matches the parenthetical aspect (2) but not (3).

nschneid commented 3 years ago

Come to think of it: do any languages code morphologically for appositives? If not I can't help but wonder if something like nmod:pred (predicative nominal modifier) wouldn't be a better solution, also for nominal depictives. (And then amod:pred for adjectival depictives. The use of acl for depictives has always seemed stained to me.)

dan-zeman commented 3 years ago

I guess the idea behind acl is that secondary predication, as any other predication, means a clause.

nschneid commented 3 years ago

OK I finally looked up what CGEL has to say about appositives, and unfortunately, it is not simple.

First, the general definition of appositive with examples of "integrated dependents in NP structure" such as the opera 'Carmen' (p. 447): image

They use the term supplementation for a parenthetical-like interruption, which includes but is not limited to NPs and appositives. Starting on p. 1356:

image image image

nschneid commented 3 years ago

Summary of the CGEL taxonomy

Supplement Non-supplement
Appositive The first contestant, Lulu, was ushered on stage.
A university lecturer, Dr. Brown, was arrested.
I met a friend of yours at the party last night - Emma Carlisle.
the opera Carmen
Non-appositive [ascriptive NP after anchor] Her father, a die-hard conservative, refused.
[ascriptive NP before anchor] A die-hard conservative, her father refused.
[ascriptive RC, verbal anchor] We stopped on the way, which made us late.
[ascriptive NP, verbal anchor] United will be playing at home, a not inconsiderable advantage.

Above I have bolded the appositive or supplement and italicized the head of the NP it refers to.

How does this relate to current criteria for appos?

It seems we are using appos for adjacent, postnominal NP supplements, whether they are properly appositives in CGEL terminology or not. The difficulties raised by this thread involve prenominal or non-adjacent NP supplements.

It is also not entirely clear where CGEL's non-supplement appositives fall under current guidelines, as discussed in #591 for example.

dan-zeman commented 3 years ago

It is also not entirely clear where CGEL's non-supplement appositives fall under current guidelines

For me, these were always doubtlessly nmod. But I have observed that some people expect prepositions or genitives when nmod is involved.

amir-zeldes commented 3 years ago

Thanks @nschneid for putting together this overview!

I just want to say two things:

Stormur commented 3 years ago

It would be the same for me, i.e., advcl.

However, it now occurs to me that it also resembles secondary predication (optional depictive), in which case the UD guidelines say it is acl(Amy, optimist). Even here, I don't think the word order matters.

In #476 it was already argued to change the annotation of "depictives" from acl to advcl, and all this discussion seems to bring new evidence! Namely, the main fact is that, even though it might not be obvious in English, where the subject is expressed obligatorily, such depictives, or secondary predications, appear independently from it: for example, as noted, they can be separated by what they refer to. So acl is not tenable, as from what I understand it requires an explicit element, else it depends on the predicate and it becomes advcl anyway. In Latin we are planning to use advcl:pred to point it out from other adverbial clauses (they use markedly different strategies, mostly coming from the fact that secondary predications seem to refer only to core arguments).

Besides, I am in agreement with the interesting differences between appos, dislocated and advcl highlighted in this discussion. Very interesting points have been made.