UniversalDependencies / UD_English-EWT

English data
Creative Commons Attribution Share Alike 4.0 International
199 stars 42 forks source link

Tagging errors for NOUN/ADJ #206

Closed muchang closed 3 years ago

muchang commented 3 years ago

Hi, the followings are the suspicious cases that tag NOUN as ADJ.

Word: "contrary"

# sent_id = answers-20111108082831AAco5PI_ans-0017
# text = But the articles I read seemed to say the contrary.
1   But but CCONJ   CC  _   6   cc  6:cc    _
2   the the DET DT  Definite=Def|PronType=Art   3   det 3:det   _
3   articles    article NOUN    NNS Number=Plur 6   nsubj   6:nsubj|8:nsubj:xsubj   _
4   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  5   nsubj   5:nsubj _
5   read    read    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    3   acl:relcl   3:acl:relcl _
6   seemed  seem    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    0   root    0:root  _
7   to  to  PART    TO  _   8   mark    8:mark  _
8   say say VERB    VB  VerbForm=Inf    6   xcomp   6:xcomp _
9   the the DET DT  Definite=Def|PronType=Art   10  det 10:det  _
10  contrary    contrary    ADJ JJ  Degree=Pos  8   obj 8:obj   SpaceAfter=No
11  .   .   PUNCT   .   _   6   punct   6:punct _

Word: "military"

# sent_id = answers-20111107092617AAgKm4X_ans-0030
# text = Instead he pulled the military off, stopped the bombing of North Vietnam, withdrew from the presidential race, and spent the rest of his term in office trying to surrender to North Vietnam.
1   Instead instead ADV RB  _   3   advmod  3:advmod    _
2   he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  3   nsubj   3:nsubj|8:nsubj|15:nsubj|22:nsubj   _
3   pulled  pull    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    0   root    0:root  _
4   the the DET DT  Definite=Def|PronType=Art   5   det 5:det   _
5   military    military    ADJ JJ  Degree=Pos  3   obj 3:obj   _
6   off off ADP RP  _   3   compound:prt    3:compound:prt  SpaceAfter=No
7   ,   ,   PUNCT   ,   _   8   punct   8:punct _
8   stopped stop    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    3   conj    3:conj:and  _
9   the the DET DT  Definite=Def|PronType=Art   10  det 10:det  _
10  bombing bombing NOUN    NN  Number=Sing 8   obj 8:obj   _
11  of  of  ADP IN  _   13  case    13:case _
12  North   North   PROPN   NNP Number=Sing 13  compound    13:compound _
13  Vietnam Vietnam PROPN   NNP Number=Sing 10  nmod    10:nmod:of  SpaceAfter=No
14  ,   ,   PUNCT   ,   _   15  punct   15:punct    _
15  withdrew    withdraw    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    3   conj    3:conj:and  _
16  from    from    ADP IN  _   19  case    19:case _
17  the the DET DT  Definite=Def|PronType=Art   19  det 19:det  _
18  presidential    presidential    ADJ JJ  Degree=Pos  19  amod    19:amod _
19  race    race    NOUN    NN  Number=Sing 15  obl 15:obl:from SpaceAfter=No
20  ,   ,   PUNCT   ,   _   22  punct   22:punct    _
21  and and CCONJ   CC  _   22  cc  22:cc   _
22  spent   spend   VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    3   conj    3:conj:and  _
23  the the DET DT  Definite=Def|PronType=Art   24  det 24:det  _
24  rest    rest    NOUN    NN  Number=Sing 22  obj 22:obj  _
25  of  of  ADP IN  _   27  case    27:case _
26  his he  PRON    PRP$    Gender=Masc|Number=Sing|Person=3|Poss=Yes|PronType=Prs  27  nmod:poss   27:nmod:poss    _
27  term    term    NOUN    NN  Number=Sing 24  nmod    24:nmod:of  _
28  in  in  ADP IN  _   29  case    29:case _
29  office  office  NOUN    NN  Number=Sing 27  nmod    27:nmod:in  _
30  trying  try VERB    VBG VerbForm=Ger    22  advcl   22:advcl    _
31  to  to  PART    TO  _   32  mark    32:mark _
32  surrender   surrender   VERB    VB  VerbForm=Inf    30  xcomp   30:xcomp    _
33  to  to  ADP IN  _   35  case    35:case _
34  North   North   PROPN   NNP Number=Sing 35  compound    35:compound _
35  Vietnam Vietnam PROPN   NNP Number=Sing 32  obl 32:obl:to   SpaceAfter=No
36  .   .   PUNCT   .   _   3   punct   3:punct _

Word: "nerve" (Nerve can only be NOUN and VERB according to the dictionary)

# sent_id = answers-20111108104724AAuBUR7_ans-0007
# text = I chalked it up to possible nerve damage, because this is what the vet thought and the other cheek muscle was fine and he had no theeth problems.
1   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  2   nsubj   2:nsubj _
2   chalked chalk   VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    0   root    0:root  _
3   it  it  PRON    PRP Case=Acc|Gender=Neut|Number=Sing|Person=3|PronType=Prs  2   obj 2:obj   _
4   up  up  ADP RP  _   2   compound:prt    2:compound:prt  _
5   to  to  ADP IN  _   8   case    8:case  _
6   possible    possible    ADJ JJ  Degree=Pos  8   amod    8:amod  _
7   nerve   nerve   ADJ JJ  Degree=Pos  8   amod    8:amod  _
8   damage  damage  NOUN    NN  Number=Sing 2   obl 2:obl:to    SpaceAfter=No
9   ,   ,   PUNCT   ,   _   2   punct   2:punct _
10  because because SCONJ   IN  _   13  mark    13:mark _
11  this    this    PRON    DT  Number=Sing|PronType=Dem    13  nsubj   13:nsubj    _
12  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   13  cop 13:cop  _|CheckUPOS=VERB
13  what    what    PRON    WP  PronType=Int    2   advcl   2:advcl:because _
14  the the DET DT  Definite=Def|PronType=Art   15  det 15:det  _
15  vet vet NOUN    NN  Number=Sing 16  nsubj   16:nsubj    _
16  thought think   VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    13  acl:relcl   13:acl:relcl    _
17  and and CCONJ   CC  _   23  cc  23:cc   _
18  the the DET DT  Definite=Def|PronType=Art   21  det 21:det  _
19  other   other   ADJ JJ  Degree=Pos  21  amod    21:amod _
20  cheek   cheek   NOUN    NN  Number=Sing 21  compound    21:compound _
21  muscle  muscle  NOUN    NN  Number=Sing 23  nsubj   23:nsubj    _
22  was be  AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   23  cop 23:cop  _
23  fine    fine    ADJ JJ  Degree=Pos  13  conj    2:advcl:because|13:conj:and _
24  and and CCONJ   CC  _   26  cc  26:cc   _
25  he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  26  nsubj   26:nsubj    _
26  had have    VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    13  conj    2:advcl:because|13:conj:and _
27  no  no  DET DT  _   29  det 29:det  _
28  theeth  theeth  NOUN    NN  Number=Sing 29  compound    29:compound _
29  problems    problem NOUN    NNS Number=Plur 26  obj 26:obj  SpaceAfter=No
30  .   .   PUNCT   .   _   2   punct   2:punct _

Word: "lab"

# sent_id = answers-20111108104724AAuBUR7_ans-0085
# text = I believe they have a real study testing the lab work in real world situations now, but it may be sometime before those results are released.
1   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  2   nsubj   2:nsubj _
2   believe believe VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    0   root    0:root  _
3   they    they    PRON    PRP Case=Nom|Number=Plur|Person=3|PronType=Prs  4   nsubj   4:nsubj _
4   have    have    VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    2   ccomp   2:ccomp _
5   a   a   DET DT  Definite=Ind|PronType=Art   7   det 7:det   _
6   real    real    ADJ JJ  Degree=Pos  7   amod    7:amod  _
7   study   study   NOUN    NN  Number=Sing 4   obj 4:obj   _
8   testing test    VERB    VBG VerbForm=Ger    7   acl 7:acl   _
9   the the DET DT  Definite=Def|PronType=Art   11  det 11:det  _
10  lab lab NOUN    NN  Number=Sing 11  compound    11:compound _
11  work    work    NOUN    NN  Number=Sing 8   obj 8:obj   _
12  in  in  ADP IN  _   15  case    15:case _
13  real    real    ADJ JJ  Degree=Pos  14  amod    14:amod _
14  world   world   NOUN    NN  Number=Sing 15  compound    15:compound _
15  situations  situation   NOUN    NNS Number=Plur 8   obl 8:obl:in    _
16  now now ADV RB  _   4   advmod  4:advmod    SpaceAfter=No
17  ,   ,   PUNCT   ,   _   23  punct   23:punct    _
18  but but CCONJ   CC  _   23  cc  23:cc   _
19  it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  23  nsubj   23:nsubj    _
20  may may AUX MD  VerbForm=Fin    23  aux 23:aux  _
21  be  be  AUX VB  VerbForm=Inf    23  cop 23:cop  _
22  some    some    DET DT  _   23  det 23:det  SpaceAfter=No|CorrectSpaceAfter=Yes
23  time    time    NOUN    NN  Number=Sing 2   conj    2:conj:but  _
24  before  before  SCONJ   IN  _   28  mark    28:mark _
25  those   that    DET DT  Number=Plur|PronType=Dem    26  det 26:det  _
26  results result  NOUN    NNS Number=Plur 28  nsubj:pass  28:nsubj:pass   _
27  are be  AUX VBP Mood=Ind|Tense=Pres|VerbForm=Fin    28  aux:pass    28:aux:pass _
28  released    release VERB    VBN Tense=Past|VerbForm=Part|Voice=Pass 23  advcl   23:advcl:before SpaceAfter=No
29  .   .   PUNCT   .   _   2   punct   2:punct _

Word: "lab"

# sent_id = reviews-154157-0005
# text = Going back to have some lab coats done this weekend!
1   Going   go  VERB    VBG VerbForm=Ger    0   root    0:root  _
2   back    back    ADV RB  _   1   advmod  1:advmod    _
3   to  to  PART    TO  _   4   mark    4:mark  _
4   have    have    VERB    VB  VerbForm=Inf    1   advcl   1:advcl:to  _
5   some    some    DET DT  _   7   det 7:det   _
6   lab lab NOUN    NN  Number=Sing 7   compound    7:compound  _
7   coats   coat    NOUN    NNS Number=Plur 4   obj 4:obj|8:nsubj:xsubj _
8   done    do  VERB    VBN Tense=Past|VerbForm=Part    4   xcomp   4:xcomp _
9   this    this    DET DT  Number=Sing|PronType=Dem    10  det 10:det  _
10  weekend weekend NOUN    NN  Number=Sing 1   obl:tmod    1:obl:tmod  SpaceAfter=No
11  !   !   PUNCT   .   _   1   punct   1:punct _

Word: "rave" (Rave can only be NOUN and VERB according to the dictionary)

# sent_id = reviews-263870-0009
# text = It is the best university in Virginia and continuously receives rave reviews every year.
1   It  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  5   nsubj   5:nsubj|10:nsubj    _
2   is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   cop 5:cop   _
3   the the DET DT  Definite=Def|PronType=Art   5   det 5:det   _
4   best    good    ADJ JJS Degree=Sup  5   amod    5:amod  _
5   university  university  NOUN    NN  Number=Sing 0   root    0:root  _
6   in  in  ADP IN  _   7   case    7:case  _
7   Virginia    Virginia    PROPN   NNP Number=Sing 5   nmod    5:nmod:in   _
8   and and CCONJ   CC  _   10  cc  10:cc   _
9   continuously    continuously    ADV RB  _   10  advmod  10:advmod   _
10  receives    receive VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   conj    5:conj:and  _
11  rave    rave    ADJ JJ  Degree=Pos  12  amod    12:amod _
12  reviews review  NOUN    NNS Number=Plur 10  obj 10:obj  _
13  every   every   DET DT  _   14  det 14:det  _
14  year    year    NOUN    NN  Number=Sing 10  obl:tmod    10:obl:tmod SpaceAfter=No
15  .   .   PUNCT   .   _   5   punct   5:punct _

Words: "pink" and "brown"

# sent_id = answers-20111108092643AAXe4lD_ans-0035
# text = The girl has either pink or brown.
1   The the DET DT  Definite=Def|PronType=Art   2   det 2:det   _
2   girl    girl    NOUN    NN  Number=Sing 3   nsubj   3:nsubj _
3   has have    VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   0   root    0:root  _
4   either  either  CCONJ   CC  _   5   cc:preconj  5:cc:preconj    _
5   pink    pink    ADJ JJ  Degree=Pos  3   obj 3:obj   _
6   or  or  CCONJ   CC  _   7   cc  7:cc    _
7   brown   brown   ADJ JJ  Degree=Pos  5   conj    3:obj|5:conj:or SpaceAfter=No
8   .   .   PUNCT   .   _   3   punct   3:punct _

Word: "subject"

# sent_id = email-enronsent12_01-0034
# text = I also deleted "subject to recall" based on our discussion, assuming you get approval for that revision.
1   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  3   nsubj   3:nsubj _
2   also    also    ADV RB  _   3   advmod  3:advmod    _
3   deleted delete  VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    0   root    0:root  _
4   "   "   PUNCT   ``  _   5   punct   5:punct SpaceAfter=No
5   subject subject ADJ JJ  Degree=Pos  3   obj 3:obj   _
6   to  to  ADP IN  _   7   case    7:case  _
7   recall  recall  NOUN    NN  Number=Sing 5   obl 5:obl:to    SpaceAfter=No
8   "   "   PUNCT   ''  _   5   punct   5:punct _
9   based   base    VERB    VBN Tense=Past|VerbForm=Part    12  case    12:case _
10  on  on  ADP IN  _   12  case    12:case _
11  our we  PRON    PRP$    Number=Plur|Person=1|Poss=Yes|PronType=Prs  12  nmod:poss   12:nmod:poss    _
12  discussion  discussion  NOUN    NN  Number=Sing 3   obl 3:obl:on    SpaceAfter=No
13  ,   ,   PUNCT   ,   _   3   punct   3:punct _
14  assuming    assume  VERB    VBG VerbForm=Ger    3   advcl   3:advcl _
15  you you PRON    PRP Case=Nom|Person=2|PronType=Prs  16  nsubj   16:nsubj    _
16  get get VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    14  ccomp   14:ccomp    _
17  approval    approval    NOUN    NN  Number=Sing 16  obj 16:obj  _
18  for for ADP IN  _   20  case    20:case _
19  that    that    DET DT  Number=Sing|PronType=Dem    20  det 20:det  _
20  revision    revision    NOUN    NN  Number=Sing 17  nmod    17:nmod:for SpaceAfter=No
21  .   .   PUNCT   .   _   3   punct   3:punct _

Word: "bella"

# sent_id = reviews-042012-0005
# text = You will also have the experience of learning the bella Sicilian culture, that I have fallen in luv with.
1   You you PRON    PRP Case=Nom|Person=2|PronType=Prs  4   nsubj   4:nsubj _
2   will    will    AUX MD  VerbForm=Fin    4   aux 4:aux   _
3   also    also    ADV RB  _   4   advmod  4:advmod    _
4   have    have    VERB    VB  VerbForm=Inf    0   root    0:root  _
5   the the DET DT  Definite=Def|PronType=Art   6   det 6:det   _
6   experience  experience  NOUN    NN  Number=Sing 4   obj 4:obj   _
7   of  of  SCONJ   IN  _   8   mark    8:mark  _
8   learning    learn   VERB    VBG VerbForm=Ger    6   acl 6:acl:of    _
9   the the DET DT  Definite=Def|PronType=Art   12  det 12:det  _
10  bella   bella   ADJ JJ  Degree=Pos  12  amod    12:amod _
11  Sicilian    Sicilian    ADJ JJ  Degree=Pos  12  amod    12:amod _
12  culture culture NOUN    NN  Number=Sing 8   obj 8:obj|17:obj    SpaceAfter=No
13  ,   ,   PUNCT   ,   _   12  punct   12:punct    _
14  that    that    PRON    WDT PronType=Rel    17  obj 12:ref  _
15  I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  17  nsubj   17:nsubj    _
16  have    have    AUX VBP Mood=Ind|Tense=Pres|VerbForm=Fin    17  aux 17:aux  _
17  fallen  fall    VERB    VBN Tense=Past|VerbForm=Part    12  acl:relcl   12:acl:relcl    _
18  in  in  ADP IN  _   19  case    19:case _
19  luv love    NOUN    NN  Abbr=Yes|Number=Sing    17  obl 17:obl:in   _
20  with    with    ADP IN  _   19  nmod    19:nmod SpaceAfter=No
21  .   .   PUNCT   .   _   4   punct   4:punct _

Commit: 2f5eba2

muchang commented 3 years ago

Hi, the followings are the suspicious cases that tag ADJ as NOUN.

Word: "arsenic"

# sent_id = answers-20111108101816AAdTp9M_ans-0006
# text = A common one is arsenic and there really isn't a way to cure that.
1   A   a   DET DT  Definite=Ind|PronType=Art   3   det 3:det   _
2   common  common  ADJ JJ  Degree=Pos  3   amod    3:amod  _
3   one one NOUN    NN  Number=Sing 5   nsubj   5:nsubj _
4   is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   cop 5:cop   _
5   arsenic arsenic NOUN    NN  Number=Sing 0   root    0:root  _
6   and and CCONJ   CC  _   9   cc  9:cc    _
7   there   there   PRON    EX  _   9   expl    9:expl  _
8   really  really  ADV RB  _   9   advmod  9:advmod    _
9-10    isn't   _   _   _   _   _   _   _   _
9   is  be  VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   5   conj    5:conj:and  _
10  n't not PART    RB  _   9   advmod  9:advmod    _
11  a   a   DET DT  Definite=Ind|PronType=Art   12  det 12:det  _
12  way way NOUN    NN  Number=Sing 9   nsubj   9:nsubj _
13  to  to  PART    TO  _   14  mark    14:mark _
14  cure    cure    VERB    VB  VerbForm=Inf    12  acl 12:acl:to   _
15  that    that    PRON    DT  Number=Sing|PronType=Dem    14  obj 14:obj  SpaceAfter=No
16  .   .   PUNCT   .   _   5   punct   5:punct _

Word: "quality"

# sent_id = answers-20111108110044AA4rs9f_ans-0012
# text = There are a lot of good quality dry cat foods.
1   There   there   PRON    EX  _   2   expl    2:expl  _
2   are be  VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    0   root    0:root  _
3   a   a   DET DT  Definite=Ind|PronType=Art   4   det 4:det   _
4   lot lot NOUN    NN  Number=Sing 2   nsubj   2:nsubj _
5   of  of  ADP IN  _   10  case    10:case _
6   good    good    ADJ JJ  Degree=Pos  7   amod    7:amod  _
7   quality quality NOUN    NN  Number=Sing 10  compound    10:compound _
8   dry dry ADJ JJ  Degree=Pos  10  amod    10:amod _
9   cat cat NOUN    NN  Number=Sing 10  compound    10:compound _
10  foods   food    NOUN    NNS Number=Plur 4   nmod    4:nmod:of   SpaceAfter=No
11  .   .   PUNCT   .   _   2   punct   2:punct _

Word: "routine"

# sent_id = answers-20111108111128AAwfype_ans-0009
# text = i have talked to some people that say that a medical exam is an invasion of privacy and other say its a routine thing when entering another country.
1   i   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  3   nsubj   3:nsubj _
2   have    have    AUX VBP Mood=Ind|Tense=Pres|VerbForm=Fin    3   aux 3:aux   _
3   talked  talk    VERB    VBN Tense=Past|VerbForm=Part    0   root    0:root  _
4   to  to  ADP IN  _   6   case    6:case  _
5   some    some    DET DT  _   6   det 6:det   _
6   people  people  NOUN    NNS Number=Plur 3   obl 3:obl:to|8:nsubj    _
7   that    that    PRON    WDT PronType=Rel    8   nsubj   6:ref   _
8   say say VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    6   acl:relcl   6:acl:relcl _
9   that    that    SCONJ   IN  _   15  mark    15:mark _
10  a   a   DET DT  Definite=Ind|PronType=Art   12  det 12:det  _
11  medical medical ADJ JJ  Degree=Pos  12  amod    12:amod _
12  exam    exam    NOUN    NN  Number=Sing 15  nsubj   15:nsubj    _
13  is  be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   15  cop 15:cop  _
14  an  a   DET DT  Definite=Ind|PronType=Art   15  det 15:det  _
15  invasion    invasion    NOUN    NN  Number=Sing 8   ccomp   8:ccomp _
16  of  of  ADP IN  _   17  case    17:case _
17  privacy privacy NOUN    NN  Number=Sing 15  nmod    15:nmod:of  _
18  and and CCONJ   CC  _   20  cc  20:cc   _
19  other   other   ADJ JJ  Degree=Pos  20  nsubj   20:nsubj    _
20  say say VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    3   conj    3:conj:and  _
21-22   its _   _   _   _   _   _   _   _
21  it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  25  nsubj   25:nsubj    _
22  s   be  AUX VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   25  cop 25:cop  _
23  a   a   DET DT  Definite=Ind|PronType=Art   25  det 25:det  _
24  routine routine NOUN    NN  Number=Sing 25  compound    25:compound _
25  thing   thing   NOUN    NN  Number=Sing 20  ccomp   20:ccomp    _
26  when    when    SCONJ   WRB PronType=Int    27  mark    27:mark _
27  entering    enter   VERB    VBG VerbForm=Ger    25  advcl   25:advcl:when   _
28  another another DET DT  _   29  det 29:det  _
29  country country NOUN    NN  Number=Sing 27  obj 27:obj  SpaceAfter=No
30  .   .   PUNCT   .   _   3   punct   3:punct _

Word: "medium"

# sent_id = answers-20111108100419AAKZvMH_ans-0003
# text = Last time I got a medium rat it was a girl and my snake handled her easily.
1   Last    last    ADJ JJ  Degree=Pos  2   amod    2:amod  _
2   time    time    NOUN    NN  Number=Sing 11  obl:tmod    11:obl:tmod _
3   I   I   PRON    PRP Case=Nom|Number=Sing|Person=1|PronType=Prs  4   nsubj   4:nsubj _
4   got get VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    2   acl:relcl   2:acl:relcl _
5   a   a   DET DT  Definite=Ind|PronType=Art   7   det 7:det   _
6   medium  medium  NOUN    NN  Number=Sing 7   compound    7:compound  _
7   rat rat NOUN    NN  Number=Sing 4   obj 4:obj   _
8   it  it  PRON    PRP Case=Nom|Gender=Neut|Number=Sing|Person=3|PronType=Prs  11  nsubj   11:nsubj    _
9   was be  AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin   11  cop 11:cop  _
10  a   a   DET DT  Definite=Ind|PronType=Art   11  det 11:det  _
11  girl    girl    NOUN    NN  Number=Sing 0   root    0:root  _
12  and and CCONJ   CC  _   15  cc  15:cc   _
13  my  my  PRON    PRP$    Number=Sing|Person=1|Poss=Yes|PronType=Prs  14  nmod:poss   14:nmod:poss    _
14  snake   snake   NOUN    NN  Number=Sing 15  nsubj   15:nsubj    _
15  handled handle  VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    11  conj    11:conj:and _
16  her she PRON    PRP$    Gender=Fem|Number=Sing|Person=3|Poss=Yes|PronType=Prs   15  obj 15:obj  _
17  easily  easily  ADV RB  _   15  advmod  15:advmod   SpaceAfter=No
18  .   .   PUNCT   .   _   11  punct   11:punct    _

Commit: 2f5eba2

nschneid commented 3 years ago


muchang commented 3 years ago

Thanks for your discussion in detail. I agree with you in general. For the "pink and brown", it's indeed not very clear, we could leave it as ADJ. For "the contrary", since "contrary" has a noun sense, I am laying towards tagging it as NOUN.

nschneid commented 3 years ago

There are 2 of these errors for "military" with a determiner: http://match.grew.fr/?corpus=UD_English-EWT@dev&custom=61214563a0034&clustering=X.upos

nschneid commented 3 years ago

@muchang For things we agree are errors, could you please submit pull requests against the dev branch? I can review those.

muchang commented 3 years ago

Here are some more cases that tag ADJ as NOUN.

Word: "minimum"

# sent_id = weblog-blogspot.com_alaindewitt_20040929103700_ENG_20040929_103700-0058
# text = In June and July of 1973, he accumulated 56 points, enough to meet the minimum requirement for the 1973-1974 year.
1   In  in  ADP IN  _   2   case    2:case  _
2   June    June    PROPN   NNP Number=Sing 9   obl 9:obl:in    _
3   and and CCONJ   CC  _   4   cc  4:cc    _
4   July    July    PROPN   NNP Number=Sing 2   conj    2:conj:and|9:obl:in _
5   of  of  ADP IN  _   6   case    6:case  _
6   1973    1973    NUM CD  NumType=Card    2   nmod    2:nmod:of   SpaceAfter=No
7   ,   ,   PUNCT   ,   _   9   punct   9:punct _
8   he  he  PRON    PRP Case=Nom|Gender=Masc|Number=Sing|Person=3|PronType=Prs  9   nsubj   9:nsubj _
9   accumulated accumulate  VERB    VBD Mood=Ind|Tense=Past|VerbForm=Fin    0   root    0:root  _
10  56  56  NUM CD  NumType=Card    11  nummod  11:nummod   _
11  points  point   NOUN    NNS Number=Plur 9   obj 9:obj   SpaceAfter=No
12  ,   ,   PUNCT   ,   _   11  punct   11:punct    _
13  enough  enough  ADJ JJ  Degree=Pos  11  amod    11:amod _
14  to  to  PART    TO  _   15  mark    15:mark _
15  meet    meet    VERB    VB  VerbForm=Inf    13  advcl   13:advcl:to _
16  the the DET DT  Definite=Def|PronType=Art   18  det 18:det  _
17  minimum minimum NOUN    NN  Number=Sing 18  compound    18:compound _
18  requirement requirement NOUN    NN  Number=Sing 15  obj 15:obj  _
19  for for ADP IN  _   24  case    24:case _
20  the the DET DT  Definite=Def|PronType=Art   24  det 24:det  _
21  1973    1973    NUM CD  NumType=Card    24  compound    24:compound SpaceAfter=No
22  -   -   SYM SYM _   23  case    23:case SpaceAfter=No
23  1974    1974    NUM CD  NumType=Card    21  nmod    21:nmod _
24  year    year    NOUN    NN  Number=Sing 18  nmod    18:nmod:for SpaceAfter=No
25  .   .   PUNCT   .   _   9   punct   9:punct _

Word: "teen"

# sent_id = weblog-blogspot.com_rigorousintuition_20060511134300_ENG_20060511_134300-0088
# text = Husbands marry mothers and teen daughters at the same time.
1   Husbands    husband NOUN    NNS Number=Plur 2   nsubj   2:nsubj _
2   marry   marry   VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    0   root    0:root  _
3   mothers mother  NOUN    NNS Number=Plur 2   obj 2:obj   _
4   and and CCONJ   CC  _   6   cc  6:cc    _
5   teen    teen    NOUN    NN  Number=Sing 6   compound    6:compound  _
6   daughters   daughter    NOUN    NNS Number=Plur 3   conj    2:obj|3:conj:and    _
7   at  at  ADP IN  _   10  case    10:case _
8   the the DET DT  Definite=Def|PronType=Art   10  det 10:det  _
9   same    same    ADJ JJ  Degree=Pos  10  amod    10:amod _
10  time    time    NOUN    NN  Number=Sing 2   obl 2:obl:at    SpaceAfter=No
11  .   .   PUNCT   .   _   2   punct   2:punct _

Word: "virtuoso"

# sent_id = weblog-blogspot.com_alaindewitt_20060924104100_ENG_20060924_104100-0225
# text = Its so-called moderates and conservatives play their own virtuoso version of the "good-cop versus bad-cop" game.
1   Its its PRON    PRP$    Gender=Neut|Number=Sing|Person=3|Poss=Yes|PronType=Prs  5   nmod:poss   5:nmod:poss _
2   so  so  ADV RB  _   4   advmod  4:advmod    SpaceAfter=No
3   -   -   PUNCT   HYPH    _   4   punct   4:punct SpaceAfter=No
4   called  call    VERB    VBN Tense=Past|VerbForm=Part    5   amod    5:amod  _
5   moderates   moderate    NOUN    NNS Number=Plur 8   nsubj   8:nsubj _
6   and and CCONJ   CC  _   7   cc  7:cc    _
7   conservatives   conservative    NOUN    NNS Number=Plur 5   conj    5:conj:and|8:nsubj  _
8   play    play    VERB    VBP Mood=Ind|Tense=Pres|VerbForm=Fin    0   root    0:root  _
9   their   they    PRON    PRP$    Number=Plur|Person=3|Poss=Yes|PronType=Prs  12  nmod:poss   12:nmod:poss    _
10  own own ADJ JJ  Degree=Pos  12  amod    12:amod _
11  virtuoso    virtuoso    NOUN    NN  Number=Sing 12  compound    12:compound _
12  version version NOUN    NN  Number=Sing 8   obj 8:obj   _
13  of  of  ADP IN  _   24  case    24:case _
14  the the DET DT  Definite=Def|PronType=Art   24  det 24:det  _
15  "   "   PUNCT   ``  _   18  punct   18:punct    SpaceAfter=No
16  good    good    ADJ JJ  Degree=Pos  18  amod    18:amod SpaceAfter=No
17  -   -   PUNCT   HYPH    _   18  punct   18:punct    SpaceAfter=No
18  cop cop NOUN    NN  Number=Sing 24  compound    24:compound _
19  versus  versus  ADP IN  _   22  case    22:case _
20  bad bad ADJ JJ  Degree=Pos  22  amod    22:amod SpaceAfter=No
21  -   -   PUNCT   HYPH    _   22  punct   22:punct    SpaceAfter=No
22  cop cop NOUN    NN  Number=Sing 18  nmod    18:nmod:versus  SpaceAfter=No
23  "   "   PUNCT   ''  _   18  punct   18:punct    _
24  game    game    NOUN    NN  Number=Sing 12  nmod    12:nmod:of  SpaceAfter=No
25  .   .   PUNCT   .   _   8   punct   8:punct _

Word: "surprise"

# sent_id = newsgroup-groups.google.com_FOOLED_7554c5ce34a5a49e_ENG_20051012_144800-0039
# text = At least one expert believes China's intentions might include a surprise attempt to circle the moon with astronauts before NASA's planned lunar return.
1   At  at  ADP IN  _   2   case    2:case  _
2   least   least   ADJ JJS Degree=Sup  3   nmod    3:nmod:at   _
3   one one NUM CD  NumType=Card    4   nummod  4:nummod    _
4   expert  expert  NOUN    NN  Number=Sing 5   nsubj   5:nsubj _
5   believes    believe VERB    VBZ Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin   0   root    0:root  _
6-7 China's _   _   _   _   _   _   _   _
6   China   China   PROPN   NNP Number=Sing 8   nmod:poss   8:nmod:poss _
7   's  's  PART    POS _   6   case    6:case  _
8   intentions  intention   NOUN    NNS Number=Plur 10  nsubj   10:nsubj    _
9   might   might   AUX MD  VerbForm=Fin    10  aux 10:aux  _
10  include include VERB    VB  VerbForm=Inf    5   ccomp   5:ccomp _
11  a   a   DET DT  Definite=Ind|PronType=Art   13  det 13:det  _
12  surprise    surprise    NOUN    NN  Number=Sing 13  compound    13:compound _
13  attempt attempt NOUN    NN  Number=Sing 10  obj 10:obj  _
14  to  to  PART    TO  _   15  mark    15:mark _
15  circle  circle  VERB    VB  VerbForm=Inf    13  acl 13:acl:to   _
16  the the DET DT  Definite=Def|PronType=Art   17  det 17:det  _
17  moon    moon    NOUN    NN  Number=Sing 15  obj 15:obj  _
18  with    with    ADP IN  _   19  case    19:case _
19  astronauts  astronaut   NOUN    NNS Number=Plur 15  obl 15:obl:with _
20  before  before  ADP IN  _   25  case    25:case _
21-22   NASA's  _   _   _   _   _   _   _   _
21  NASA    NASA    PROPN   NNP Number=Sing 25  nmod:poss   25:nmod:poss    _
22  's  's  PART    POS _   21  case    21:case _
23  planned plan    VERB    VBN Tense=Past|VerbForm=Part    25  amod    25:amod _
24  lunar   lunar   ADJ JJ  Degree=Pos  25  amod    25:amod _
25  return  return  NOUN    NN  Number=Sing 15  obl 15:obl:before   SpaceAfter=No
26  .   .   PUNCT   .   _   5   punct   5:punct _

Word: "reptile"

# sent_id = answers-20111108104350AAp4hGP_ans-0015
# newpar id = answers-20111108104350AAp4hGP_ans-p0004
# text = My general convince your parents to let you get a reptile advice:
1   My  my  PRON    PRP$    Number=Sing|Person=1|Poss=Yes|PronType=Prs  12  nmod:poss   12:nmod:poss    _
2   general general ADJ JJ  Degree=Pos  12  amod    12:amod _
3   convince    convince    VERB    VB  Mood=Imp|VerbForm=Fin   12  compound    12:compound _
4   your    you PRON    PRP$    Person=2|Poss=Yes|PronType=Prs  5   nmod:poss   5:nmod:poss _
5   parents parent  NOUN    NNS Number=Plur 3   obj 3:obj|7:nsubj:xsubj _
6   to  to  PART    TO  _   7   mark    7:mark  _
7   let let VERB    VB  VerbForm=Inf    3   xcomp   3:xcomp _
8   you you PRON    PRP Case=Nom|Person=2|PronType=Prs  7   obj 7:obj|9:nsubj:xsubj _
9   get get VERB    VB  VerbForm=Inf    7   xcomp   7:xcomp _
10  a   a   DET DT  Definite=Ind|PronType=Art   11  det 11:det  _
11  reptile reptile NOUN    NN  Number=Sing 9   obj 9:obj   _
12  advice  advice  NOUN    NN  Number=Sing 0   root    0:root  SpaceAfter=No
13  :   :   PUNCT   :   _   12  punct   12:punct    _

Word: "Gold"

# sent_id = reviews-080221-0012
# text = Gold award parts excellence, metro.
1   Gold    gold    NOUN    NN  Number=Sing 2   compound    2:compound  _
2   award   award   NOUN    NN  Number=Sing 0   root    0:root  _
3   parts   part    NOUN    NNS Number=Plur 4   compound    4:compound  _
4   excellence  excellence  NOUN    NN  Number=Sing 2   compound    2:compound  SpaceAfter=No
5   ,   ,   PUNCT   ,   _   2   punct   2:punct _
6   metro   metro   PROPN   NNP Number=Sing 2   list    2:list  SpaceAfter=No
7   .   .   PUNCT   .   _   2   punct   2:punct _
muchang commented 3 years ago

For things we agree are errors, could you please submit pull requests against the dev branch? I can review those.

Sure, could you please tell me how to systematically fix the issues rather than just correct them in .conllu?

nschneid commented 3 years ago

Specific instances need to be edited in .conllu. If there's a systematic fix that can be made by rule, @amir-zeldes's DepEdit tool can help.

muchang commented 3 years ago

I see, I will first edit in .conllu and see.

nschneid commented 3 years ago
muchang commented 3 years ago

Thanks, Nathan. I agree with your opinions. I have fixed the first three (minimum, teen, virtuoso).