propbank / propbank-release

The official released annotations, both in .prop pointer format and as conll files. Does not contain the source texts
Creative Commons Attribution Share Alike 4.0 International
135 stars 12 forks source link

SRL on top of UD #17

Open arademaker opened 2 years ago

arademaker commented 2 years ago

74 How much is its accuracy , I think , is still quite hard to determine at this time , indeed ,

─┮  
 │     ╭─╼ How ADV advmod 1 2  
 │   ╭─┶ much ADJ dep 2 3  
 │ ╭─┾ is VERB ccomp 3 13              <= ARG1-DSP
 │ │ │ ╭─╼ its PRON nmod:poss 4 5  
 │ │ ╰─┶ accuracy NOUN nsubj 5 3  
 │ ├─╼ , PUNCT punct 6 13  
 │ │ ╭─╼ I PRON nsubj 7 8  
 │ ├─┶ think VERB parataxis 8 13       <= PRED think.01
 │ ├─╼ , PUNCT punct 9 13  
 │ ├─╼ is AUX cop 10 13  
 │ ├─╼ still ADV advmod 11 13  
 │ ├─╼ quite ADV advmod 12 13  
 ╰─┾ hard ADJ root 13 0                <= ARG1-DSP
   │ ╭─╼ to PART mark 14 15  
   ├─┾ determine VERB xcomp 15 13  
   │ │ ╭─╼ at ADP case 16 18  
   │ │ ├─╼ this DET det 17 18  
   │ ├─┶ time NOUN obl 18 15  
   │ ├─╼ , PUNCT punct 19 15  
   │ ╰─╼ indeed ADV advmod 20 15  
   ╰─╼ , PUNCT punct 21 13  

The cases of ARG?-DSP, when projected to dependencies may not be trivial.

Does the annotation above make sense? @MarthaSPalmer? Considering that token 3 is argument of token 13, can we only annotate token 13? The DSP would only mark that the argument is actually an ancestor of the token 8 (the predicate).

@nschneid, what about the syntactic annotation?

arademaker commented 2 years ago

Yet another case to encourage some thoughts. Basically, here, the 'that' was annotated as WHADVP in the constituent analysis and in UD it is SCONJ/mark introducing the subordinate clause.

13> It was during this year that the Japanese army developed a strategy to rapidly force the Chinese people into submission by the end of 1940 .

─┮  
 │ ╭─╼ It PRON nsubj 1 5  
 │ ├─╼ was AUX cop 2 5  
 │ ├─╼ during ADP case 3 5  
 │ ├─╼ this DET det 4 5  
 ╰─┾ year NOUN root 5 0              <= ARGM-TMP
   │ ╭─╼ that ADP mark 6 10  
   │ │ ╭─╼ the DET det 7 9  
   │ │ ├─╼ Japanese ADJ amod 8 9  
   │ ├─┶ army NOUN nsubj 9 10  
   ├─┾ developed VERB acl 10 5       <= develop.02
   │ │ ╭─╼ a DET det 11 12  
   │ ╰─┾ strategy NOUN obj 12 10  
   │   │ ╭─╼ to PART mark 13 15  
   │   │ ├─╼ rapidly ADV advmod 14 15  
   │   ╰─┾ force VERB acl 15 12  
   │     │ ╭─╼ the DET det 16 18  
   │     │ ├─╼ Chinese ADJ amod 17 18  
   │     ├─┶ people NOUN obj 18 15  
   │     │ ╭─╼ into ADP case 19 20  
   │     ├─┶ submission NOUN obl 20 15  
   │     │ ╭─╼ by ADP case 21 23  
   │     │ ├─╼ the DET det 22 23  
   │     ╰─┾ end NOUN obl 23 15  
   │       │ ╭─╼ of ADP case 24 25  
   │       ╰─┶ 1940 NUM nmod 25 23  
   ╰─╼ . PUNCT punct 26 5  

Previously

Tree:
-----
    (TOP (S-CLF (NP-SBJ (PRP It))
                (VP (VBD was)
                    (PP-TMP-PRD (IN during)
                                (NP (DT this)
                                    (NN year)))
                    (SBAR (WHADVP-2 (IN that))
                          (S (NP-SBJ (DT the)
                                     (JJ Japanese)
                                     (NN army))
                             (VP (VBD developed)
                                 (NP (NP (DT a)
                                         (NN strategy))
                                     (SBAR (WHNP-1 (-NONE- 0))
                                           (S (NP-SBJ (-NONE- *T*-1))
                                              (VP (TO to)
                                                  (ADVP-MNR (RB rapidly))
                                                  (VP (VB force)
                                                      (NP (DT the)
                                                          (JJ Chinese)
                                                          (NNS people))
                                                      (PP-CLR (IN into)
                                                              (NP (NN submission)))
                                                      (PP-TMP (IN by)
                                                              (NP (NP (DT the)
                                                                      (NN end))
                                                                  (PP (IN of)
                                                                      (NP (CD 1940))))))))))
                                 (ADVP-TMP (-NONE- *T*-2))))))
                (. .)))

Leaves:
-------
    0   It
    1   was
           sense: be-v.1
           prop:  be.01
            v          * -> 1:0,  was
            ARG2       * -> 2:1,  during this year
            ARG1       * -> 5:2,  that the Japanese army developed a strategy 0 *T*-1 to rapidly force the Chinese
                                  people into submission by the end of 1940 *T*-2
    2   during
    3   this
           coref: IDENT        247   3-4    this year
           name:  DATE               3-4    this year
    4   year
    5   that
    6   the
           coref: IDENT        211   6-8    the Japanese army
    7   Japanese
           name:  NORP               7-7    Japanese
    8   army
    9   developed
           sense: develop-v.1
           prop:  develop.02
            v          * -> 9:0,  developed
            ARG0       * -> 6:1,  the Japanese army
            ARG1       * -> 10:2, a strategy 0 *T*-1 to rapidly force the Chinese people into submission by the end of
                                  1940
            ARGM-TMP   * -> 27:0, *T*-2 -> 5:1, that
            LINK-SLC   * -> 27:0, *T*-2 -> 5:1, that
                       * -> 2:1,  during this year
nschneid commented 2 years ago

74 How much is its accuracy , I think , is still quite hard to determine at this time , indeed ,

@nschneid, what about the syntactic annotation?

Well, the sentence isn't 100% grammatical.

This sentence exemplifies a tough-construction: [How much is its accuracy] + hard + [to determine]. I'm not entirely sure how to analyze it, but I suspect it should be csubj instead of ccomp for the attachment of the first clause. There is a long unresolved discussion at UniversalDependencies/docs#308.

MarthaSPalmer commented 2 years ago

This is way above my pay grade.

Katie, haven’t you done a lot of PB annotation on dependency parses? How would you have annotated the sentence below?

Nathan may well be right that it isn’t grammatical or the right parse, but PropBank would still try to annotate what is there.

Martha

Begin forwarded message:

From: Nathan Schneider @.**@.>> Subject: Re: [propbank/propbank-release] SRL on top of UD (Issue #17) Date: November 26, 2021 at 11:42:52 PM MST To: propbank/propbank-release @.**@.>> Cc: Martha Palmer @.**@.>>, Mention @.**@.>> Reply-To: propbank/propbank-release @.**@.>>

74 How much is its accuracy , I think , is still quite hard to determine at this time , indeed ,

@nschneidhttps://github.com/nschneid, what about the syntactic annotation?

Well, the sentence isn't 100% grammatical.

This sentence exemplifies a tough-construction: [How much is its accuracy] + hard + [to determine]. I'm not entirely sure how to analyze it, but I suspect it should be csubj instead of ccomp for the attachment of the first clause. There is a long unresolved discussion at UniversalDependencies/docs#308https://github.com/UniversalDependencies/docs/issues/308.

On Nov 24, 2021, at 1:25 PM, Alexandre Rademaker @.**@.>> wrote:

74 How much is its accuracy , I think , is still quite hard to determine at this time , indeed ,

─┮ │ ╭─╼ How ADV advmod 1 2 │ ╭─┶ much ADJ dep 2 3 │ ╭─┾ is VERB ccomp 3 13 <= ARG1-DSP │ │ │ ╭─╼ its PRON nmod:poss 4 5 │ │ ╰─┶ accuracy NOUN nsubj 5 3 │ ├─╼ , PUNCT punct 6 13 │ │ ╭─╼ I PRON nsubj 7 8 │ ├─┶ think VERB parataxis 8 13 <= PRED think.01 │ ├─╼ , PUNCT punct 9 13 │ ├─╼ is AUX cop 10 13 │ ├─╼ still ADV advmod 11 13 │ ├─╼ quite ADV advmod 12 13 ╰─┾ hard ADJ root 13 0 <= ARG1-DSP │ ╭─╼ to PART mark 14 15 ├─┾ determine VERB xcomp 15 13 │ │ ╭─╼ at ADP case 16 18 │ │ ├─╼ this DET det 17 18 │ ├─┶ time NOUN obl 18 15 │ ├─╼ , PUNCT punct 19 15 │ ╰─╼ indeed ADV advmod 20 15 ╰─╼ , PUNCT punct 21 13

The cases of ARG?-DSPhttps://github.com/propbank/propbank-release/blob/master/docs/conll-conversion-notes.md#argx-dsp-arguments, when projected to dependencies may not be trivial.

Does the annotation above make sense? @MarthaSPalmerhttps://github.com/MarthaSPalmer? Considering that token 3 is argument of token 13, can we only annotate token 13? The DSP would only mark that the argument is actually an ancestor of the token 8 (the predicate).

@nschneidhttps://github.com/nschneid, what about the syntactic annotation?

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/propbank/propbank-release/issues/17, or unsubscribehttps://github.com/notifications/unsubscribe-auth/ABB327USNK7XHCLNG4ZXVNDUNVC27ANCNFSM5IW56WYQ. Triage notifications on the go with GitHub Mobile for iOShttps://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Androidhttps://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.