calzada / PARLAMINT-ES-MC

2 stars 4 forks source link

no guest speaker #40

Closed matyaskopp closed 1 year ago

matyaskopp commented 1 year ago

Originally reported here: https://github.com/clarin-eric/ParlaMint/issues/696#issue-1765368729

There are no guest speakers in the whole corpus, I have been investigating it and it seems that if the value speaker/ni/text() is different from CD, then u/@ana should be #guest. eg this situation: source https://github.com/calzada/PARLAMINT-ES-MC/blob/0a94780418e87be3b367938c2fae4ccb7dba1b15/CD/CD150512.xml#L174-L199

result https://github.com/calzada/PARLAMINT-ES-MC/blob/0a94780418e87be3b367938c2fae4ccb7dba1b15/ParlaMint-ES.TEI/ParlaMint-ES_2015-05-12-CD150512.xml#L126-L137

am I right?


All values and frequencies:

cat CD/*|grep '<ni '|grep -v '>CD<'|sort|uniq -c|sort -nr
     27 <ni country="ES">NA</ni>
     13 <ni country="ES">Parlament-Catalunya</ni>
     12 <ni country="ES">Les Corts Valencianes</ni>
     12 <ni country="ES">Asamblea Regional de Murcia</ni>
     10 <ni country="ES">Parlamento de les Illes Balears</ni>
      6 <ni country="ES">Parlamento de Navarra</ni>
      5 <ni country="ES">Parlamento de Cantabria</ni>
      5 <ni country="ES">Parlamento de Andalucía</ni>
      5 <ni country="ES">Cortes de Aragón</ni>
      4 <ni country="ES">UNKNONW</ni>
      4 <ni country="ES">Parlamento de Andalucía</ni>
      4 <ni country="ES">Parlament Illes Balears</ni>
      4 <ni country="ES">Junta General del Principado de Asturias</ni>
      4 <ni country="ES">Asamblea de Madrid</ni>
      3 <ni country="ES">Parlamento Vasco</ni>
      3 <ni country="ES">Parlamento Vasco</ni>
      3 <ni country="ES">Parlamento de les Illes Balears</ni>
      3 <ni country="ES">Parlamento de la Rioja</ni>
      3 <ni country="ES">Asamblea Regional de Murcia</ni>
      2 <ni country="ES">Parlamento de Navarra</ni>
      2 <ni country="ES">Parlamento de Canarias</ni>
      2 <ni country="ES">Parlamento de Canarias</ni>
      2 <ni country="ES">Defensora del Pueblo</ni>
      2 <ni country="ES">Cortes de Aragón</ni>
      2 <ni country="ES">Asamble Regional de Murcia</ni>
      1 <ni country="ES">Xunta de Galicia</ni>
      1 <ni country="ES">Xunta de Galicia</ni>
      1 <ni country="ES">UNKNOWN</ni>
      1 <ni country="ES">Parlamento\{Z}de Galicia</ni>
      1 <ni country="ES">Parlamento de Murcia</ni>
      1 <ni country="ES">Parlamento de Galicia</ni>
      1 <ni country="ES">Parlamento de Galicia</ni>
      1 <ni country="ES">Defensor del Pueblo</ni>
      1 <ni country="ES">Defensor del Pueblo</ni>
      1 <ni country="ES">Asamblea de Madrid</ni>
      1 <ni country="ES">Asamblea de la Comunidad de Madrid</ni>
calzada commented 1 year ago

Ok. Yes ni is people from nacional institucional such as regionales parlamento who come to speak as guests. You could also have the case of vip, a very important person who soeaks at the parlamento, such as a King or Bill Gates. Is this useful? Mc

El vie, 4 ago 2023, 9:20, Matyáš Kopp @.***> escribió:

Assigned #40 https://github.com/calzada/PARLAMINT-ES-MC/issues/40 to @calzada https://github.com/calzada.

— Reply to this email directly, view it on GitHub https://github.com/calzada/PARLAMINT-ES-MC/issues/40#event-10006255915, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AREQQ2ZZ4FQUFRIZTYM3XTSPFPANCNFSM6AAAAAA3DZWPLA . You are receiving this because you were assigned.Message ID: @.***>

matyaskopp commented 1 year ago

So is this true? if speaker[not(institution/ni/text() = 'CD')] then the speaker is guest.

calzada commented 1 year ago

Please copy and paste here an intervention metadata so that I can doble check I am travelling you see. Also the CD dtd I think has this information as comments. Best

El vie, 4 ago 2023, 15:56, Matyáš Kopp @.***> escribió:

So is this true? if speaker[not(institution/ni/text() = 'CD')] then the speaker is guest.

— Reply to this email directly, view it on GitHub https://github.com/calzada/PARLAMINT-ES-MC/issues/40#issuecomment-1665651665, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2ARETLDILMCAYPS22T7ZDXTT5QRANCNFSM6AAAAAA3DZWPLA . You are receiving this because you were mentioned.Message ID: @.***>

matyaskopp commented 1 year ago

This is a relevant part of dtd, without comments:

<!ELEMENT speaker (name,birth_date,birth_place,status,gender,institution,constituency,affiliation,post)>

<!ELEMENT institution (ni|io|ngo|vip) >
<!ELEMENT ni (#PCDATA) >
<!ATTLIST ni country (ES|UNKNOWN|NA|NON-ES) #REQUIRED >
<!ELEMENT io (#PCDATA) >
<!ELEMENT ngo (#PCDATA) >
<!ELEMENT vip (#PCDATA) >
calzada commented 1 year ago

Sorry Matyas. I thought we had added comments. I am checking your previos comentarios now. Best

El vie, 4 ago 2023, 21:31, Matyáš Kopp @.***> escribió:

This is a relevant part of dtd, without comments:

<!ELEMENT speaker (name,birth_date,birth_place,status,gender,institution,constituency,affiliation,post)>

<!ELEMENT institution (ni|io|ngo|vip) > <!ELEMENT ni (#PCDATA) > <!ATTLIST ni country (ES|UNKNOWN|NA|NON-ES) #REQUIRED > <!ELEMENT io (#PCDATA) > <!ELEMENT ngo (#PCDATA) > <!ELEMENT vip (#PCDATA) >

— Reply to this email directly, view it on GitHub https://github.com/calzada/PARLAMINT-ES-MC/issues/40#issuecomment-1666079097, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AREQ56BMG52L5YV7DJODXTVE2RANCNFSM6AAAAAA3DZWPLA . You are receiving this because you were mentioned.Message ID: @.***>

calzada commented 1 year ago

This is indeed true

El vie, 4 ago 2023, 15:56, Matyáš Kopp @.***> escribió:

So is this true? if speaker[not(institution/ni/text() = 'CD')] then the speaker is guest.

— Reply to this email directly, view it on GitHub https://github.com/calzada/PARLAMINT-ES-MC/issues/40#issuecomment-1665651665, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2ARETLDILMCAYPS22T7ZDXTT5QRANCNFSM6AAAAAA3DZWPLA . You are receiving this because you were mentioned.Message ID: @.***>