calzada / PARLAMINT-ES-MC

2 stars 4 forks source link

bugs in linguistic annotations #49

Closed matyaskopp closed 1 year ago

matyaskopp commented 1 year ago
ERROR ParlaMint-ES_2015-04-14-CD150414.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2015-04-14-CD150414.u17.p3.s4.w53 #ParlaMint-ES_2015-04-14-CD150414.u17.p3.s4.w59"
ERROR ParlaMint-ES_2015-04-30-CD150430.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2015-04-30-CD150430.u24.p2.s6.w87 #ParlaMint-ES_2015-04-30-CD150430.u24.p2.s6.w85"
ERROR ParlaMint-ES_2015-09-16-CD150916.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2015-09-16-CD150916.u69.p3.s1.w28 #ParlaMint-ES_2015-09-16-CD150916.u69.p3.s1.w30"
ERROR ParlaMint-ES_2016-12-20-CD161220.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2016-12-20-CD161220.u3.p6.s4.w74 #ParlaMint-ES_2016-12-20-CD161220.u3.p6.s4.w68"
ERROR ParlaMint-ES_2017-11-14-CD171114.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2017-11-14-CD171114.u77.p7.s5.w24 #ParlaMint-ES_2017-11-14-CD171114.u77.p7.s5.w30"
/project/corpora/Parla/ParlaMint/ParlaMint/Corpora/Master/ParlaMint-ES.TEI.ana/2018/ParlaMint-ES_2018-05-31-CD180531.ana.xml:149410:190: error: element "pc" not allowed here; expected text or element "w"
/project/corpora/Parla/ParlaMint/ParlaMint/Corpora/Master/ParlaMint-ES.TEI.ana/2018/ParlaMint-ES_2018-05-31-CD180531.ana.xml:149410:190: error: character content of element "pc" invalid; must be a string matching the regular expression "\S+"
ERROR ParlaMint-ES_2021-02-24-CD210224.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w32 #ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w34"
ERROR ParlaMint-ES_2021-02-24-CD210224.ana: ERROR: Can't find local id for link/@target="#ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w34 #ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w35"
/project/corpora/Parla/ParlaMint/ParlaMint/Corpora/Master/ParlaMint-ES.TEI.ana/2021/ParlaMint-ES_2021-03-17-CD210317.ana.xml:70496:184: error: element "pc" not allowed here; expected text or element "w"
/project/corpora/Parla/ParlaMint/ParlaMint/Corpora/Master/ParlaMint-ES.TEI.ana/2021/ParlaMint-ES_2021-03-17-CD210317.ana.xml:70496:184: error: character content of element "pc" invalid; must be a string matching the regular expression "\S+"
ERROR: syntactic head #ParlaMint-ES_2018-06-27-CD180627.u261.p4.s1.w52 not found for id ParlaMint-ES_2018-06-27-CD180627.u261.p4.s1.w53
ERROR: syntactic head #ParlaMint-ES_2018-06-27-CD180627.u261.p4.s1.w52 not found for id ParlaMint-ES_2018-06-27-CD180627.u261.p4.s1.w56
ERROR: syntactic head #ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w34 not found for id ParlaMint-ES_2021-02-24-CD210224.u28.p10.s3.w35
matyaskopp commented 1 year ago

All of this errors seem to be related to #51

but a few <pc> are preserved inside <w>, not sure why:

ParlaMint-ES.TEI.ana/2018/ParlaMint-ES_2018-05-31-CD180531.ana.xml:149410:190: error: element "pc" not allowed here; expected text or element "w"