UniversalDependencies / UD_English-GUM

Other
32 stars 4 forks source link

Incorrect annotations for some relative clauses #72

Closed xiulinyang closed 11 months ago

xiulinyang commented 11 months ago

There might be some errors in the following annotations.

The sentence This process, as you’ll recall from What is Anthropology? is called enculturation. is not a relative clause. Therefore, the edge acl:relcl for the token recall is incorrect.

# sent_id = GUM_textbook_anthropology-35
# s_prominence = 2
# s_type = decl
# transition = establishment
# text = This process, as you’ll recall from What is Anthropology? is called enculturation.
1   This    this    DET DT  Number=Sing|PronType=Dem    2   det 2:det   Discourse=elaboration-additional:83->81:3|Entity=(75-abstract-giv:act-cf1*-2-coref
2   process process NOUN    NN  Number=Sing 14  nsubj:pass  7:obl:from|14:nsubj:pass|15:nsubj:xsubj Entity=75)|SpaceAfter=No
3   ,   ,   PUNCT   ,   _   7   punct   7:punct _
4   as  as  SCONJ   IN  _   7   mark    7:mark  Discourse=explanation-evidence:84->85:0
5-6 you’ll  _   _   _   _   _   _   _   _
5   you you PRON    PRP Case=Nom|Number=Sing|Person=2|PronType=Prs  7   nsubj   7:nsubj Entity=(15-person-giv:inact-cf2-1-ana)
6   ’ll will    AUX MD  VerbForm=Fin    7   aux 7:aux   _
7   recall  recall  VERB    VB  VerbForm=Inf    2   acl:relcl   2:acl:relcl _
8   from    from    ADP IN  _   9   case    9:case  _
9   What    what    PRON    WP  PronType=Rel    7   obl 2:ref   Entity=(81-abstract-new-cf4-1-sgl|XML=<ref target:::"https://openstax.org/books/introduction-anthropology/pages/1-introduction">
10  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   11  cop 11:cop  _
11  Anthropology    anthropology    NOUN    NN  Number=Sing 9   acl:relcl   9:acl:relcl Entity=(82-abstract-new-cf3-1-sgl)|SpaceAfter=No
12  ?   ?   PUNCT   .   _   2   punct   2:punct Entity=81)|XML=</ref>
13  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   14  aux:pass    14:aux:pass Discourse=same-unit_m:85->83:0
14  called  call    VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 0   root    0:root  _
15  enculturation   enculturation   NOUN    NN  Number=Sing 14  xcomp   14:xcomp    Entity=(75-abstract-giv:act-cf1*-1-coref)|SpaceAfter=No
16  .   .   PUNCT   .   _   14  punct   14:punct    _

In An exchange student from China might be annoyed by the constant interruptions in class as other students ask questions—a practice that is considered rude in China., that should be pass:nsubj rather than mark.

# sent_id = GUM_textbook_sociology-29
# s_prominence = 2
# s_type = sub
# transition = null
# text = An exchange student from China might be annoyed by the constant interruptions in class as other students ask questions—a practice that is considered rude in China.
1   An  a   DET DT  Definite=Ind|PronType=Art   3   det 3:det   Discourse=joint-list_m:76->75:0|Entity=(144-person-new-cf3-3-coref
2   exchange    exchange    NOUN    NN  Number=Sing 3   compound    3:compound  _
3   student student NOUN    NN  Number=Sing 8   nsubj:pass  8:nsubj:pass    _
4   from    from    ADP IN  _   5   case    5:case  _
5   China   China   PROPN   NNP Number=Sing 3   nmod    3:nmod:from Entity=(145-place-new-cf1-1-coref-China)144)
6   might   might   AUX MD  VerbForm=Fin    8   aux 8:aux   _
7   be  be  AUX VB  VerbForm=Inf    8   aux:pass    8:aux:pass  _
8   annoyed annoy   VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 0   root    0:root  _
9   by  by  ADP IN  _   12  case    12:case _
10  the the DET DT  Definite=Def|PronType=Art   12  det 12:det  Entity=(146-event-new-cf6-3-sgl
11  constant    constant    ADJ JJ  Degree=Pos  12  amod    12:amod _
12  interruptions   interruption    NOUN    NNS Number=Plur 8   obl:agent   8:obl:agent _
13  in  in  ADP IN  _   14  case    14:case _
14  class   class   NOUN    NN  Number=Sing 12  nmod    12:nmod:in  Entity=(147-abstract-new-cf7-1-sgl)146)
15  as  as  SCONJ   IN  _   18  mark    18:mark Discourse=context-circumstance:77->76:0
16  other   other   ADJ JJ  Degree=Pos  17  amod    17:amod Entity=(148-person-new-cf4-2-sgl
17  students    student NOUN    NNS Number=Plur 18  nsubj   18:nsubj    Entity=148)
18  ask ask VERB    VBP Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin   12  acl 12:acl:as   Entity=(149-event-new-cf2-1-disc
19  questions   question    NOUN    NNS Number=Plur 18  obj 18:obj  Entity=(150-abstract-new-cf5-1-sgl)149)|SpaceAfter=No|XML=<w>
20  —   —   PUNCT   :   _   22  punct   22:punct    Discourse=elaboration-additional:78->77:0|SpaceAfter=No
21  a   a   DET DT  Definite=Ind|PronType=Art   22  det 22:det  Entity=(149-event-giv:act-cf2-2-coref|XML=</w>
22  practice    practice    NOUN    NN  Number=Sing 12  appos   12:appos|25:mark    _
23  that    that    SCONJ   WDT PronType=Rel    25  mark    22:ref  Discourse=elaboration-attribute:79->78:0
24  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   25  aux:pass    25:aux:pass _
25  considered  consider    VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 22  acl:relcl   22:acl:relcl    _
26  rude    rude    ADJ JJ  Degree=Pos  25  xcomp   25:xcomp    _
27  in  in  ADP IN  _   28  case    28:case _
28  China   China   PROPN   NNP Number=Sing 25  obl 25:obl:in   Entity=(145-place-giv:act-cf1-1-coref-China)149)|SpaceAfter=No
29  .   .   PUNCT   .   _   8   punct   8:punct _

Simiarly, in AKA every vegetarian thing that Ash has in her fridge that will remotely go well together., the second that should be an nsubj rather than mark.

# sent_id = GUM_vlog_college-65
# s_prominence = 3
# s_type = other
# speaker = AshleyClaire
# transition = establishment
# text = AKA every vegetarian thing that Ash has in her fridge that will remotely go well together.
1   AKA a.k.a.  ADV RB  Degree=Pos  4   discourse   4:discourse Discourse=elaboration-attribute:109->108:0
2   every   every   DET DT  PronType=Tot    4   det 4:det   Entity=(67-substance-giv:act-cf2*-3-coref
3   vegetarian  vegetarian  ADJ JJ  Degree=Pos  4   amod    4:amod  _
4   thing   thing   NOUN    NN  Number=Sing 0   root    0:root|7:obj|14:mark    _
5   that    that    PRON    WDT PronType=Rel    7   obj 4:ref   Discourse=elaboration-attribute:110->109:0
6   Ash Ash PROPN   NNP Number=Sing 7   nsubj   7:nsubj Entity=(3-person-giv:inact-cf1-1-coref)
7   has have    VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   4   acl:relcl   4:acl:relcl _
8   in  in  ADP IN  _   10  case    10:case _
9   her her PRON    PRP$    Case=Gen|Gender=Fem|Number=Sing|Person=3|Poss=Yes|PronType=Prs  10  nmod:poss   10:nmod:poss    Entity=(79-object-new-cf3-2-sgl(3-person-giv:act-cf1-1-ana)
10  fridge  fridge  NOUN    NN  Number=Sing 7   obl 7:obl:in    Entity=79)
11  that    that    SCONJ   WDT PronType=Rel    14  mark    4:ref   Discourse=elaboration-attribute:111->109:1
12  will    will    AUX MD  VerbForm=Fin    14  aux 14:aux  _
13  remotely    remotely    ADV RB  Degree=Pos  14  advmod  14:advmod   _
14  go  go  VERB    VB  VerbForm=Inf    4   acl:relcl   4:acl:relcl _
15  well    well    ADV RB  Degree=Pos  14  advmod  14:advmod   _
16  together    together    ADV RB  Degree=Pos  14  advmod  14:advmod   Entity=67)|SpaceAfter=No
17  .   .   PUNCT   .   _   4   punct   4:punct _

When he became chairman of this city committee, there were 300,000 more Republicans registered in the city of Philadelphia than Democrats, and it is a source of satisfaction to me that tonight there are 260,000 more Democrats registered. It seems that the clause after the coordination is not a relative clause?

# sent_id = GUM_speech_remarks-4
# s_prominence = 3
# s_type = decl
# speaker = JohnFKennedy
# transition = smooth-shift
# text = When he became chairman of this city committee, there were 300,000 more Republicans registered in the city of Philadelphia than Democrats, and it is a source of satisfaction to me that tonight there are 260,000 more Democrats registered.
1   When    when    ADV WRB PronType=Int    3   advmod  3:advmod    Discourse=context-circumstance:8->9:1
2   he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  3   nsubj   3:nsubj|4:nsubj:xsubj   Entity=(11-person-giv:act-cf1*-1-ana-William_J._Green_Jr.)
3   became  become  VERB    VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   11  advcl   11:advcl:when   _
4   chairman    chairman    NOUN    NN  Number=Sing 3   xcomp   3:xcomp Entity=(11-person-giv:act-cf1*-1-coref-William_J._Green_Jr.
5   of  of  ADP IN  _   8   case    8:case  _
6   this    this    DET DT  Number=Sing|PronType=Dem    8   det 8:det   Entity=(13-organization-new-cf8-3-sgl
7   city    city    NOUN    NN  Number=Sing 8   compound    8:compound  Entity=(2-place-giv:act-cf3-1-coref-Philadelphia)
8   committee   committee   NOUN    NN  Number=Sing 4   nmod    4:nmod:of   Entity=13)11)|SpaceAfter=No
9   ,   ,   PUNCT   ,   _   3   punct   3:punct _
10  there   there   PRON    EX  PronType=Dem    11  expl    11:expl Discourse=context-background:9->7:0
11  were    be  VERB    VBD Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin   0   root    0:root  _
12  300,000 300000  NUM CD  NumForm=Digit|NumType=Card  13  nummod  13:nummod   Entity=(14-person-new-cf6-3-sgl
13  more    more    ADJ JJR Degree=Cmp  14  amod    14:amod _
14  Republicans Republican  PROPN   NNPS    Number=Plur 11  nsubj   11:nsubj    _
15  registered  register    VERB    VBN Tense=Past|VerbForm=Part    14  acl 14:acl  _
16  in  in  ADP IN  _   18  case    18:case _
17  the the DET DT  Definite=Def|PronType=Art   18  det 18:det  Entity=(2-place-giv:act-cf3-2-coref-Philadelphia
18  city    city    NOUN    NN  Number=Sing 15  obl 15:obl:in   _
19  of  of  ADP IN  _   20  case    20:case _
20  Philadelphia    Philadelphia    PROPN   NNP Number=Sing 18  nmod    18:nmod:of  Entity=2)14)
21  than    than    ADP IN  _   22  case    22:case _
22  Democrats   Democrat    PROPN   NNPS    Number=Plur 14  obl 14:obl:than Entity=(15-person-new-cf9-1-sgl)|SpaceAfter=No
23  ,   ,   PUNCT   ,   _   28  punct   28:punct    _
24  and and CCONJ   CC  _   28  cc  28:cc   Discourse=joint-list_m:10->9:0
25  it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  28  nsubj   28:nsubj    Entity=(16-abstract-acc:com-cf4-1-ana)
26  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   28  cop 28:cop  _
27  a   a   DET DT  Definite=Ind|PronType=Art   28  det 28:det  Entity=(16-abstract-giv:act-cf4-2-pred
28  source  source  NOUN    NN  Number=Sing 11  conj    11:conj:and _
29  of  of  ADP IN  _   30  case    30:case _
30  satisfaction    satisfaction    NOUN    NN  Number=Sing 28  nmod    28:nmod:of  Entity=(17-abstract-new-cf10-1-sgl)
31  to  to  ADP IN  _   32  case    32:case _
32  me  I   PRON    PRP Case=Acc|Number=Sing|Person=1|PronType=Prs  28  nmod    28:nmod:to  Entity=(1-person-giv:act-cf2-1-ana-John_F._Kennedy)16)
33  that    that    SCONJ   IN  _   36  mark    36:mark Entity=(16-abstract-giv:act-cf4-4-disc
34  tonight tonight NOUN    NN  Number=Sing 36  obl:tmod    36:obl:tmod Entity=(18-time-acc:com-cf5-1-sgl)
35  there   there   PRON    EX  PronType=Dem    36  expl    36:expl _
36  are be  VERB    VBP Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin   28  acl:relcl   28:acl:relcl    _
37  260,000 260000  NUM CD  NumForm=Digit|NumType=Card  38  compound    38:compound Entity=(19-person-new-cf7-3-sgl
38  more    more    ADJ JJR Degree=Cmp  39  amod    39:amod _
39  Democrats   Democrat    PROPN   NNPS    Number=Plur 36  nsubj   36:nsubj    Entity=19)
40  registered  register    VERB    VBN Tense=Past|VerbForm=Part    39  acl 39:acl  Entity=16)|SpaceAfter=No
41  .   .   PUNCT   .   _   11  punct   11:punct    _

Thanks!

xiulinyang commented 11 months ago

whose misses the ref edge.

# sent_id = GUM_fiction_wedding-9
# s_prominence = 3
# s_type = decl
# transition = smooth-shift
# text = The actual author is Johann Valentin Andreae, whose name didn’t appear on the book originally, thus ensuring the confusion.
1   The the DET DT  Definite=Def|PronType=Art   3   det 3:det   Discourse=adversative-contrast_m:17->16:0|Entity=(7-person-giv:inact-cf2-3-coref-Johannes_Valentinus_Andreae
2   actual  actual  ADJ JJ  Degree=Pos  3   amod    3:amod  _
3   author  author  NOUN    NN  Number=Sing 5   nsubj   5:nsubj Entity=7)
4   is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   cop 5:cop   _
5   Johann  Johann  PROPN   NNP Number=Sing 0   root    0:root  Entity=(7-person-giv:act-cf2-1,2,3-coref-Johannes_Valentinus_Andreae
6   Valentin    Valentin    PROPN   NNP Number=Sing 5   flat    5:flat  _
7   Andreae Andreae PROPN   NNP Number=Sing 5   flat    5:flat  SpaceAfter=No
8   ,   ,   PUNCT   ,   _   13  punct   13:punct    _
9   whose   whose   PRON    WP$ Poss=Yes|PronType=Rel   10  nmod:poss   10:nmod:poss    Discourse=elaboration-attribute:18->17:0|Entity=(21-abstract-new-cf3-2-sgl
10  name    name    NOUN    NN  Number=Sing 13  nsubj   13:nsubj    Entity=21)
11-12   didn’t  _   _   _   _   _   _   _   _
11  did do  AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   13  aux 13:aux  _
12  n’t not PART    RB  Polarity=Neg    13  advmod  13:advmod   _
13  appear  appear  VERB    VB  VerbForm=Inf    5   acl:relcl   5:acl:relcl _
14  on  on  ADP IN  _   16  case    16:case _
15  the the DET DT  Definite=Def|PronType=Art   16  det 16:det  Entity=(1-abstract-giv:act-cf1*-2-coref-Chymical_Wedding_of_Christian_Rosenkreutz
16  book    book    NOUN    NN  Number=Sing 13  obl 13:obl:on   Entity=1)
17  originally  originally  ADV RB  Degree=Pos  13  advmod  13:advmod   SpaceAfter=No
18  ,   ,   PUNCT   ,   _   20  punct   20:punct    _
19  thus    thus    ADV RB  _   20  advmod  20:advmod   Discourse=causal-result:19->18:0
20  ensuring    ensure  VERB    VBG VerbForm=Ger    13  advcl   13:advcl    _
21  the the DET DT  Definite=Def|PronType=Art   22  det 22:det  Entity=(22-abstract-new-cf4-2-sgl
22  confusion   confusion   NOUN    NN  Number=Sing 20  obj 20:obj  Entity=22)7)|SpaceAfter=No
23  .   .   PUNCT   .   _   5   punct   5:punct _
amir-zeldes commented 11 months ago

Thanks for reporting, will fix!