Closed muchang closed 3 years ago
Definitely, these are errors. Thanks!
Thanks, Amir. However, I am not sure about the following case:
# sent_id = GUM_bio_jerome-2
# s_type = decl
# text = Jerome (/ dʒəˈroʊm /; Latin: Eusebius Sophronius Hieronymus; Greek: Εὐσέβιος Σωφρόνιος Ἱερώνυμος; c. 347 – 30 September 420) was a Latin Catholic priest, confessor, theologian, and historian, commonly known as Saint Jerome.
1 Jerome Jerome PROPN NNP Number=Sing 30 nsubj 30:nsubj|32:nsubj|34:nsubj|37:nsubj Discourse=ROOT:2|Entity=(person-1-Jerome)
2 ( ( PUNCT -LRB- _ 4 punct 4:punct Discourse=elaboration:3->2|SpaceAfter=No
3 / / PUNCT SYM _ 4 punct 4:punct _
4 dʒəˈroʊm dʒəˈroʊm PROPN NNP Number=Sing 1 appos 1:appos Entity=(person-1-Jerome)
5 / / PUNCT SYM _ 9 punct 9:punct SpaceAfter=No
6 ; ; PUNCT : _ 9 punct 9:punct _
7 Latin Latin PROPN NNP Number=Sing 9 nmod 9:nmod Discourse=preparation:4->5|Entity=(abstract-2-Latin)|SpaceAfter=No
8 : : PUNCT : _ 7 punct 7:punct _
9 Eusebius Eusebius PROPN NNP Number=Sing 1 appos 1:appos Discourse=joint:5->3|Entity=(person-1-Jerome
10 Sophronius Sophronius PROPN NNP Number=Sing 9 flat 9:flat _
11 Hieronymus Hieronymus PROPN NNP Number=Sing 9 flat 9:flat SpaceAfter=No
12 ; ; PUNCT : _ 15 punct 15:punct Entity=person-1-Jerome)
13 Greek Greek PROPN NNP Number=Sing 15 nmod 15:nmod Discourse=preparation:6->7|Entity=(abstract-3-Greek_language)|SpaceAfter=No
14 : : PUNCT : _ 13 punct 13:punct _
15 Εὐσέβιος Εὐσέβιος PROPN NNP Number=Sing 1 appos 1:appos Discourse=joint:7->3|Entity=(person-1-Jerome
16 Σωφρόνιος Σωφρόνιος PROPN NNP Number=Sing 15 flat 15:flat _
17 Ἱερώνυμος Ἱερώνυμος PROPN NNP Number=Sing 15 flat 15:flat Entity=person-1-Jerome)|SpaceAfter=No
18 ; ; PUNCT : _ 20 punct 20:punct _
19 c. c. ADV FW Abbr=Yes 20 advmod 20:advmod Discourse=joint:8->3
20 347 347 NUM CD NumForm=Digit|NumType=Card 1 nmod:tmod 1:nmod:tmod Entity=(time-4)
21 – - SYM SYM _ 22 case 22:case _
22 30 30 NUM CD NumForm=Digit|NumType=Card 20 nmod 20:nmod:to Entity=(time-5
23 September September PROPN NNP Number=Sing 22 compound 22:compound Entity=(time-6
24 420 420 NUM CD NumForm=Digit|NumType=Card 22 nmod:tmod 22:nmod:tmod Entity=(time-7)time-5)time-6)|SpaceAfter=No
25 ) ) PUNCT -RRB- _ 20 punct 20:punct _
26 was be AUX VBD Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin 30 cop 30:cop Discourse=same-unit:9->2
27 a a DET DT Definite=Ind|PronType=Art 30 det 30:det Entity=(person-1-Jerome
28 Latin Latin ADJ JJ Degree=Pos 30 amod 30:amod _
29 Catholic Catholic ADJ JJ Degree=Pos 30 amod 30:amod _
30 priest priest NOUN NN Number=Sing 0 root 0:root SpaceAfter=No
31 , , PUNCT , _ 32 punct 32:punct _
32 confessor confessor NOUN NN Number=Sing 30 conj 30:conj:and SpaceAfter=No
33 , , PUNCT , _ 34 punct 34:punct _
34 theologian theologian NOUN NN Number=Sing 30 conj 30:conj:and SpaceAfter=No
35 , , PUNCT , _ 37 punct 37:punct _
36 and and CCONJ CC _ 37 cc 37:cc _
37 historian historian NOUN NN Number=Sing 30 conj 30:conj:and Entity=person-1-Jerome)|SpaceAfter=No
38 , , PUNCT , _ 40 punct 40:punct _
39 commonly commonly ADV RB Degree=Pos 40 advmod 40:advmod Discourse=elaboration:10->2
40 known know VERB VBN Tense=Past|VerbForm=Part 30 acl 30:acl _
41 as as ADP IN _ 42 case 42:case _
42 Saint Saint PROPN NNP Number=Sing 40 obl 40:obl:as Entity=(person-1-Jerome
43 Jerome Jerome PROPN NNP Number=Sing 42 flat 42:flat Entity=person-1-Jerome)|SpaceAfter=No
44 . . PUNCT . _ 30 punct 30:punct _
The "/" in the following sentence seems to be SYM as well.
I think here it's just a graphic device delimiting a phonological transcription, so I would say it's PUNCT. This way its deprel can be punct. Otherwise, what would the deprel be? The UD guidelines suggest that SYM "can be substituted by normal words", which I don't think is the case here: https://universaldependencies.org/u/pos/SYM.html
Yeah, I agree with you.
In GUM, most of "/"s are tagged as SYM while the following three are tagged as PUNCT.
Commit: d38df82