UniversalDependencies / UD_English-EWT

English data
Creative Commons Attribution Share Alike 4.0 International
201 stars 43 forks source link

Tricky EUD cases #521

Closed xiulinyang closed 6 months ago

xiulinyang commented 6 months ago

I don't know how to annotate the EUD for relative clauses in the following sentences.

train split

# sent_id = answers-20111108091921AAaLK4e_ans-0017
# text = Anyway, so that was 2 days ago that I called and left a message, but he still hasn't called back yet.
1   Anyway  anyway  INTJ    UH  _   8   discourse   8:discourse SpaceAfter=No
2   ,   ,   PUNCT   ,   _   1   punct   1:punct _
3   so  so  ADV RB  _   8   advmod  8:advmod    _
4   that    that    PRON    DT  Number=Sing|PronType=Dem    8   expl    8:expl  _
5   was be  AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   8   cop 8:cop   _
6   2   2   NUM CD  NumForm=Digit|NumType=Card  7   nummod  7:nummod    _
7   days    day NOUN    NNS Number=Plur 8   obl:npmod   8:obl:npmod _
8   ago ago ADV RB  _   0   root    0:root  _
9   that    that    SCONJ   IN  _   11  mark    11:mark _
10  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  11  nsubj   11:nsubj|13:nsubj   _
11  called  call    VERB    VBD Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin   8   advcl:relcl 8:advcl:relcl   Cxn=rc-red-missingedep
12  and and CCONJ   CC  _   13  cc  13:cc   _
**13    left    leave   VERB    VBD Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin   11  conj    8:advcl|11:conj:and _**
14  a   a   DET DT  Definite=Ind|PronType=Art   15  det 15:det  _
15  message message NOUN    NN  Number=Sing 13  obj 13:obj  SpaceAfter=No
16  ,   ,   PUNCT   ,   _   22  punct   22:punct    _
17  but but CCONJ   CC  _   22  cc  22:cc   _
18  he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  22  nsubj   22:nsubj    _
19  still   still   ADV RB  _   22  advmod  22:advmod   _
20-21   hasn't  _   _   _   _   _   _   _   _
20  has have    AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   22  aux 22:aux  _
21  n't not PART    RB  _   22  advmod  22:advmod   _
22  called  call    VERB    VBN Tense=Past|VerbForm=Part    8   conj    8:conj:but  _
23  back    back    ADP RP  _   22  compound:prt    22:compound:prt _
24  yet yet ADV RB  _   22  advmod  22:advmod   SpaceAfter=No
25  .   .   PUNCT   .   _   8   punct   8:punct _

# sent_id = reviews-188548-0004
# text = When it came time to pay the bill up front, they would not let me use any of the certificate for a tip (which I have done with any other restaurant I've gotten a gift certificate for.)
1   When    when    ADV WRB PronType=Int    3   advmod  3:advmod    _
2   it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  3   expl    3:expl  _
3   came    come    VERB    VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   15  advcl   15:advcl:when   _
4   time    time    NOUN    NN  Number=Sing 3   obj 3:obj   _
5   to  to  PART    TO  _   6   mark    6:mark  _
6   pay pay VERB    VB  VerbForm=Inf    4   acl 4:acl:to    _
7   the the DET DT  Definite=Def|PronType=Art   8   det 8:det   _
8   bill    bill    NOUN    NN  Number=Sing 6   obj 6:obj   _
9   up  up  ADV RB  _   10  advmod  10:advmod   _
10  front   front   ADV RB  _   6   advmod  6:advmod    SpaceAfter=No
11  ,   ,   PUNCT   ,   _   3   punct   3:punct _
12  they    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  15  nsubj   15:nsubj    _
13  would   would   AUX MD  VerbForm=Fin    15  aux 15:aux  _
14  not not PART    RB  _   15  advmod  15:advmod   _
15  let let VERB    VB  VerbForm=Inf    0   root    0:root  _
16  me  I   PRON    PRP Case=Acc|Number=Sing|Person=1|PronType=Prs  15  obj 15:obj|17:nsubj:xsubj   _
17  use use VERB    VB  VerbForm=Inf    15  xcomp   15:xcomp    _
18  any any DET DT  PronType=Ind    17  obj 17:obj  _
19  of  of  ADP IN  _   21  case    21:case _
20  the the DET DT  Definite=Def|PronType=Art   21  det 21:det  _
21  certificate certificate NOUN    NN  Number=Sing 18  nmod    18:nmod:of  _
22  for for ADP IN  _   24  case    24:case _
23  a   a   DET DT  Definite=Ind|PronType=Art   24  det 24:det  _
24  tip tip NOUN    NN  Number=Sing 17  obl 17:obl:for  _
25  (   (   PUNCT   -LRB-   _   29  punct   29:punct    SpaceAfter=No
26  which   which   PRON    WDT PronType=Rel    29  obj 29:obj  _
27  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  29  nsubj   29:nsubj    _
28  have    have    AUX VBP Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin   29  aux 29:aux  _
29  done    do  VERB    VBN Tense=Past|VerbForm=Part    17  advcl:relcl 17:advcl:relcl  Cxn=rc-red-missingedep-pstrand
30  with    with    ADP IN  _   33  case    33:case _
31  any any DET DT  PronType=Ind    33  det 33:det  _
32  other   other   ADJ JJ  Degree=Pos  33  amod    33:amod _
33  restaurant  restaurant  NOUN    NN  Number=Sing 29  obl 29:obl:with|36:obl  _
34-35   I've    _   _   _   _   _   _   _   _
34  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  36  nsubj   36:nsubj    _
35  've have    AUX VBP Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin   36  aux 36:aux  _
36  gotten  get VERB    VBN Tense=Past|VerbForm=Part    33  acl:relcl   33:acl:relcl    Cxn=rc-red-obl-pstrand
37  a   a   DET DT  Definite=Ind|PronType=Art   39  det 39:det  _
38  gift    gift    NOUN    NN  Number=Sing 39  compound    39:compound _
39  certificate certificate NOUN    NN  Number=Sing 36  obj 36:obj  _
40  for for ADP IN  _   39  nmod    39:nmod Promoted=Yes|SpaceAfter=No
41  .   .   PUNCT   .   _   29  punct   29:punct    SpaceAfter=No
42  )   )   PUNCT   -RRB-   _   29  punct   29:punct    _

# sent_id = reviews-188548-0004
# text = When it came time to pay the bill up front, they would not let me use any of the certificate for a tip (which I have done with any other restaurant I've gotten a gift certificate for.)
1   When    when    ADV WRB PronType=Int    3   advmod  3:advmod    _
2   it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  3   expl    3:expl  _
3   came    come    VERB    VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   15  advcl   15:advcl:when   _
4   time    time    NOUN    NN  Number=Sing 3   obj 3:obj   _
5   to  to  PART    TO  _   6   mark    6:mark  _
6   pay pay VERB    VB  VerbForm=Inf    4   acl 4:acl:to    _
7   the the DET DT  Definite=Def|PronType=Art   8   det 8:det   _
8   bill    bill    NOUN    NN  Number=Sing 6   obj 6:obj   _
9   up  up  ADV RB  _   10  advmod  10:advmod   _
10  front   front   ADV RB  _   6   advmod  6:advmod    SpaceAfter=No
11  ,   ,   PUNCT   ,   _   3   punct   3:punct _
12  they    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  15  nsubj   15:nsubj    _
13  would   would   AUX MD  VerbForm=Fin    15  aux 15:aux  _
14  not not PART    RB  _   15  advmod  15:advmod   _
15  let let VERB    VB  VerbForm=Inf    0   root    0:root  _
16  me  I   PRON    PRP Case=Acc|Number=Sing|Person=1|PronType=Prs  15  obj 15:obj|17:nsubj:xsubj   _
17  use use VERB    VB  VerbForm=Inf    15  xcomp   15:xcomp    _
18  any any DET DT  PronType=Ind    17  obj 17:obj  _
19  of  of  ADP IN  _   21  case    21:case _
20  the the DET DT  Definite=Def|PronType=Art   21  det 21:det  _
21  certificate certificate NOUN    NN  Number=Sing 18  nmod    18:nmod:of  _
22  for for ADP IN  _   24  case    24:case _
23  a   a   DET DT  Definite=Ind|PronType=Art   24  det 24:det  _
24  tip tip NOUN    NN  Number=Sing 17  obl 17:obl:for  _
25  (   (   PUNCT   -LRB-   _   29  punct   29:punct    SpaceAfter=No
26  which   which   PRON    WDT PronType=Rel    29  obj 29:obj  _
27  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  29  nsubj   29:nsubj    _
28  have    have    AUX VBP Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin   29  aux 29:aux  _
29  done    do  VERB    VBN Tense=Past|VerbForm=Part    17  advcl:relcl 17:advcl:relcl  Cxn=rc-red-missingedep-pstrand
30  with    with    ADP IN  _   33  case    33:case _
31  any any DET DT  PronType=Ind    33  det 33:det  _
32  other   other   ADJ JJ  Degree=Pos  33  amod    33:amod _
33  restaurant  restaurant  NOUN    NN  Number=Sing 29  obl 29:obl:with|36:obl  _
34-35   I've    _   _   _   _   _   _   _   _
34  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  36  nsubj   36:nsubj    _
35  've have    AUX VBP Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin   36  aux 36:aux  _
36  gotten  get VERB    VBN Tense=Past|VerbForm=Part    33  acl:relcl   33:acl:relcl    Cxn=rc-red-obl-pstrand
37  a   a   DET DT  Definite=Ind|PronType=Art   39  det 39:det  _
38  gift    gift    NOUN    NN  Number=Sing 39  compound    39:compound _
39  certificate certificate NOUN    NN  Number=Sing 36  obj 36:obj  _
40  for for ADP IN  _   39  nmod    39:nmod Promoted=Yes|SpaceAfter=No
41  .   .   PUNCT   .   _   29  punct   29:punct    SpaceAfter=No
42  )   )   PUNCT   -RRB-   _   29  punct   29:punct    _

dev split

# sent_id = answers-20111108102900AA9qsc8_ans-0004
# newpar id = answers-20111108102900AA9qsc8_ans-p0004
# text = its your cat you can pick and name you want
1-2 its _   _   _   _   _   _   _   _
1   it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  4   nsubj   4:nsubj _
2   s   be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|Typo=Yes|VerbForm=Fin  4   cop 4:cop   CorrectForm='s
3   your    your    PRON    PRP$    Case=Gen|Person=2|Poss=Yes|PronType=Prs 4   nmod:poss   4:nmod:poss _
4   cat cat NOUN    NN  Number=Sing 0   root    0:root  _
5   you you PRON    PRP Case=Nom|Person=2|PronType=Prs  7   nsubj   7:nsubj _
6   can can AUX MD  VerbForm=Fin    7   aux 7:aux   _
7   pick    pick    VERB    VB  VerbForm=Inf    4   parataxis   4:parataxis _
8   and a   DET DT  Definite=Ind|PronType=Art|Typo=Yes  9   det 9:det   CorrectForm=a
9   name    name    NOUN    NN  Number=Sing 7   obj 7:obj|11:obj    _
10  you you PRON    PRP Case=Nom|Person=2|PronType=Prs  11  nsubj   11:nsubj    _
11  want    want    VERB    VBP Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin   9   acl:relcl   9:acl:relcl Cxn=rc-red-obj

# sent_id = weblog-blogspot.com_aggressivevoicedaily_20060814163400_ENG_20060814_163400-0007
# text = "They can freely write anything they like about our prophet, but if one raises doubts about the Holocaust he is either fined or sent to prison," he added.
1   "   "   PUNCT   ``  _   5   punct   5:punct SpaceAfter=No
2   They    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  5   nsubj   5:nsubj _
3   can can AUX MD  VerbForm=Fin    5   aux 5:aux   _
4   freely  freely  ADV RB  _   5   advmod  5:advmod    _
5   write   write   VERB    VB  VerbForm=Inf    32  ccomp   32:ccomp    _
6   anything    anything    PRON    NN  Number=Sing|PronType=Ind    5   obj 5:obj|8:obj _
7   they    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  8   nsubj   8:nsubj _
8   like    like    VERB    VBP Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin   6   acl:relcl   6:acl:relcl Cxn=rc-red-obj
9   about   about   ADP IN  _   11  case    11:case _
10  our our PRON    PRP$    Case=Gen|Number=Plur|Person=1|Poss=Yes|PronType=Prs 11  nmod:poss   11:nmod:poss    _
11  prophet prophet NOUN    NN  Number=Sing 6   nmod    6:nmod:about    SpaceAfter=No
12  ,   ,   PUNCT   ,   _   24  punct   24:punct    _
13  but but CCONJ   CC  _   24  cc  24:cc   _
14  if  if  SCONJ   IN  _   16  mark    16:mark _
15  one one PRON    PRP Number=Sing|Person=3|PronType=Prs   16  nsubj   16:nsubj    _
16  raises  raise   VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   24  advcl   24:advcl:if _
17  doubts  doubt   NOUN    NNS Number=Plur 16  obj 16:obj  _
18  about   about   ADP IN  _   20  case    20:case _
19  the the DET DT  Definite=Def|PronType=Art   20  det 20:det  _
20  Holocaust   Holocaust   PROPN   NNP Number=Sing 17  nmod    17:nmod:about   _
21  he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  24  nsubj:pass  24:nsubj:pass|26:nsubj:pass _
22  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   24  aux:pass    24:aux:pass _
23  either  either  CCONJ   CC  _   24  cc:preconj  24:cc:preconj   _
24  fined   fine    VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 5   conj    5:conj:but|32:ccomp _
25  or  or  CCONJ   CC  _   26  cc  26:cc   _
26  sent    send    VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 24  conj    24:conj:or  _
27  to  to  ADP IN  _   28  case    28:case _
28  prison  prison  NOUN    NN  Number=Sing 26  obl 26:obl:to   SpaceAfter=No
29  ,   ,   PUNCT   ,   _   5   punct   5:punct SpaceAfter=No
30  "   "   PUNCT   ''  _   5   punct   5:punct _
31  he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  32  nsubj   32:nsubj    _
32  added   add VERB    VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   0   root    0:root  SpaceAfter=No
33  .   .   PUNCT   .   _   32  punct   32:punct    _

test split

# sent_id = reviews-241108-0004
# text = Did services I asked them NOTto do and was still charged.
1   Did do  VERB    VBD Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin   0   root    0:root  _
2   services    service NOUN    NNS Number=Plur 1   obj 1:obj   _
3   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  4   nsubj   4:nsubj _
4   asked   ask VERB    VBD Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin   2   acl:relcl   2:acl:relcl Cxn=rc-red-missingedep
5   them    they    PRON    PRP Case=Acc|Number=Plur|Person=3|PronType=Prs  4   iobj    4:iobj|8:nsubj:xsubj    _
6   NOT not PART    RB  _   8   advmod  8:advmod    CorrectSpaceAfter=Yes|SpaceAfter=No
7   to  to  PART    TO  _   8   mark    8:mark  _
8   do  do  VERB    VB  VerbForm=Inf    4   xcomp   4:xcomp _
9   and and CCONJ   CC  _   12  cc  12:cc   _
10  was be  AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   12  aux:pass    12:aux:pass _
11  still   still   ADV RB  _   12  advmod  12:advmod   _
12  charged charge  VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 1   conj    1:conj:and  SpaceAfter=No
13  .   .   PUNCT   .   _   1   punct   1:punct _

# sent_id = answers-20111108100703AAo53QA_ans-0008
# text = Universities will take you whatever age you are.
1   Universities    university  NOUN    NNS Number=Plur 3   nsubj   3:nsubj _
2   will    will    AUX MD  VerbForm=Fin    3   aux 3:aux   _
3   take    take    VERB    VB  VerbForm=Inf    0   root    0:root  _
4   you you PRON    PRP Case=Acc|Person=2|PronType=Prs  3   obj 3:obj   _
5   whatever    whatever    DET WDT PronType=Int    6   det 6:det   _
6   age age NOUN    NN  Number=Sing 3   obl:npmod   3:obl:npmod|8:obl   _
7   you you PRON    PRP Case=Nom|Person=2|PronType=Prs  8   nsubj   8:nsubj _
8   are be  AUX VBP Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin   6   acl:relcl   6:acl:relcl Cxn=rc-red-obl-auxstrand|Promoted=Yes|SpaceAfter=No
9   .   .   PUNCT   .   _   3   punct   3:punct _
nschneid commented 6 months ago

These are tough ones!

xiulinyang commented 6 months ago

Ok I just updated the annotation.

nschneid commented 6 months ago

Thanks! I'll fix the typo upstream.

Are all the missingedep annotations gone now?

xiulinyang commented 6 months ago

Now yes, I found one missingedep in the train split and I changed the construction to rc-wh-obl without adding the EUD.

# sent_id = reviews-319816-0016
# text = So I pointed this out to him, at which point he said they only had one of the correct tires in stock.
1   So  so  ADV RB  _   3   advmod  3:advmod    _
2   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  3   nsubj   3:nsubj _
3   pointed point   VERB    VBD Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Fin   0   root    0:root  _
4   this    this    PRON    DT  Number=Sing|PronType=Dem    3   obj 3:obj   _
5   out out ADP RP  _   3   compound:prt    3:compound:prt  _
6   to  to  ADP IN  _   7   case    7:case  _
7   him he  PRON    PRP Case=Acc|Gender=Masc|Number=Sing|Person=3|PronType=Prs  3   obl 3:obl:to    SpaceAfter=No
8   ,   ,   PUNCT   ,   _   13  punct   13:punct    _
9   at  at  ADP IN  _   11  case    11:case _
10  which   which   DET WDT PronType=Rel    11  det 11:det  _
11  point   point   NOUN    NN  Number=Sing 13  obl 13:obl:at   _
12  he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  13  nsubj   13:nsubj    _
13  said    say VERB    VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   3   advcl:relcl 3:advcl:relcl   Cxn=rc-wh-obl
14  they    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  16  nsubj   16:nsubj    _
15  only    only    ADV RB  _   16  advmod  16:advmod   _
16  had have    VERB    VBD Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin   13  ccomp   13:ccomp    _
17  one one NUM CD  NumForm=Word|NumType=Card   16  obj 16:obj  _
18  of  of  ADP IN  _   21  case    21:case _
19  the the DET DT  Definite=Def|PronType=Art   21  det 21:det  _
20  correct correct ADJ JJ  Degree=Pos  21  amod    21:amod _
21  tires   tire    NOUN    NNS Number=Plur 17  nmod    17:nmod:of  _
22  in  in  ADP IN  _   23  case    23:case _
23  stock   stock   NOUN    NN  Number=Sing 16  obl 16:obl:in   SpaceAfter=No
24  .   .   PUNCT   .   _   3   punct   3:punct _
nschneid commented 6 months ago

This is part of #392