Open matyaskopp opened 1 year ago
To identify member of Parliament, see
thanks @calzada
To identify member of Parliament, see .+?.
Now I can see it, but there is no affiliation timespan. Are there changes in government during government periods? https://github.com/matyaskopp/ParlaMint/blob/e48f74e3c66adb5a32b8d1051be3d2ebb58c097c/Data/ParlaMint-ES/ParlaMint-ES-listOrg.xml#L47-L56
<listEvent>
<event from="2011-12-21" to="2018-06-01" xml:id="GOV.6">
<label xml:lang="es">Séptimo Gobierno de España (21.12.2011 - 02.06-2018)</label>
<label xml:lang="en">7th Government of Spain (21.12.2011 - 02.06-2018)</label>
</event>
<event from="2018-06-02" xml:id="GOV.7">
<label xml:lang="es">Octavo Gobierno de España (02.06.2018-)</label>
<label xml:lang="en">8th Government of Spain (02.06.2018-)</label>
</event>
</listEvent>
Or can the minister be affiliated for the whole period?
Is the list of ministers complete? ( = Did every minister have a speech in parliament?)
Let me just have a look this afternoon / evening. Best for now. mc
El mié, 21 jun 2023 a las 11:40, Matyáš Kopp @.***>) escribió:
thanks @calzada https://github.com/calzada
To identify member of Parliament, see .+?.
Now I can see it, but there is no affiliation timespan. Are there changes in government during government periods?
<listEvent> <event from="2011-12-21" to="2018-06-01" xml:id="GOV.6"> <label xml:lang="es">Séptimo Gobierno de España (21.12.2011 - 02.06-2018)</label> <label xml:lang="en">7th Government of Spain (21.12.2011 - 02.06-2018)</label> </event> <event from="2018-06-02" xml:id="GOV.7"> <label xml:lang="es">Octavo Gobierno de España (02.06.2018-)</label> <label xml:lang="en">8th Government of Spain (02.06.2018-)</label> </event> </listEvent>
Or can the minister be affiliated for the whole period?
Is the list of ministers complete? ( = Did every minister have a speech in parliament?)
— Reply to this email directly, view it on GitHub https://github.com/clarin-eric/ParlaMint/issues/696#issuecomment-1600521089, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2ARERKOOBQSQFKLGG45ZTXMK6RXANCNFSM6AAAAAAZNIGNO4 . You are receiving this because you were mentioned.Message ID: @.***>
I have taken a more detailed look into the content of <post>
element.
<speaker>
<name>Pastor Julián, Ana María</name>
<birth_date>19571111</birth_date>
<birth_place country="ES">Cubillos</birth_place>
<status>NA</status>
<gender>female</gender>
<institution>
<ni country="ES">CD</ni>
</institution>
<constituency country="ES" region="Madrid"/>
<affiliation>
<national_party>PP</national_party>
<cd group="GP"/>
</affiliation>
<post> VICEPRESIDENTA</post>
</speaker>
affiliations can be represented this way:
<affiliation ref="#CD" role="member" from="2015-01-21" to="2023-02-22"/> <!-- first and last seen in parliament -->
<affiliation ref="#CD" role="deputyHead"/> <!-- first and last seen in parliament (in this role) should be added/ or do we have a better source for this? -->
<!-- and also parliamentaryGroup and optionally party should be added: -->
<affiliation role="member" ref="#group.GP"/>
<affiliation role="member" ref="#party.PP"/>
<post>VICEPRESIDENTA PRIMERA DEL GOBIERNO, MINISTRA DE LA PRESIDENCIA, RELACIONES CON LAS CORTES Y MEMORIA DEMOCRÁTICA</post>
should become (and again, an issue with unknown dates)
<affiliation ref="#GOV" role="member"/>
<affiliation ref="#GOV" role="deputyHead">
<roleName>VICEPRESIDENTA PRIMERA DEL GOBIERNO</roleName>
</affiliation>
<affiliation ref="#GOV" role="minister">
<roleName>MINISTRA DE LA PRESIDENCIA, RELACIONES CON LAS CORTES Y MEMORIA DEMOCRÁTICA</roleName>
</affiliation>
Is there anything I have to do?
Gathering minister information from Wikipedia can be done with a script (I hope). @charlicruz or @matyaskopp can do it.
Another issue is to decide how to handle parliamentary groups and their possible relation with political parties. Is this information reachable?
This is needed, for complex solution:
or we can do it easily (with a small lie - most of ParlaMinters do it):
politicalParty
role to parliamentaryGroup
role@calzada, are you ok with the 2nd option?
politicalParty
role to parliamentaryGroup
role can be done without any need to gather additional information. This can be done by me.Dear Matyas, Please do whatever seems easier for you. In fact, changing political party to parliamentary group has some sense. So if this is easier for you, please do it. After we finish this update, I need to talk to you about Sketch engine. It does not select parliamentary party/group properly. But this is for later. Let me know if there is still anything I can do. Best mc
El lun, 26 jun 2023 a las 7:45, Matyáš Kopp @.***>) escribió:
Is there anything I have to do?
Gathering minister information from Wikipedia can be done with a script (I hope). @charlicruz https://github.com/charlicruz or @matyaskopp https://github.com/matyaskopp can do it.
Another issue is to decide how to handle parliamentary groups and their possible relation with political parties. Is this information reachable?
This is needed, for complex solution:
- parliamentary group full names - in transcriptions there are only abbreviated ones
- party-group representation relation timespan
- finally coalition/opposition should show relation among parliamentary groups
or we can do it easily (with a small lie - most of ParlaMinters do it):
- change politicalParty role to parliamentaryGroup role
@calzada https://github.com/calzada, are you ok with the 2nd option?
- I personally prefer 2nd option because I am not sure if @charlicruz https://github.com/charlicruz is with us and changing politicalParty role to parliamentaryGroup role can be done without any need to gather additional information. This can be done by me.
— Reply to this email directly, view it on GitHub https://github.com/clarin-eric/ParlaMint/issues/696#issuecomment-1606691868, or unsubscribe https://github.com/notifications/unsubscribe-auth/AA2AREXO6FQXIYUPI4XJGJTXNEOYNANCNFSM6AAAAAAZNIGNO4 . You are receiving this because you were mentioned.Message ID: @.***>
I can modify the politicalParty role to parliamentaryGroup for all xml files. I have uploaded the CD150120.xml example. Again, I have problems with commit and push under GitHub desktop as I have no permisssion and I uploaded directly by webpage. If it is correct, we do it for the rest, what do you think?
Another issue is there are so many
Matyas, will you be available in July?
I can modify the politicalParty role to parliamentaryGroup for all xml files.
This is already done:
I have uploaded the CD150120.xml example. Again, I have problems with commit and push under GitHub desktop as I have no permisssion and I uploaded directly by webpage. If it is correct, we do it for the rest, what do you think?
I don't know what should I think, you are modifying source CD
format https://github.com/charlicruz/PARLAMINT-ES-MC/commit/09457fddb1c93e4067f78ddfde7f456c956b8f85 which become invalid according to https://github.com/charlicruz/PARLAMINT-ES-MC/blob/master/CD/cd.dtd
You have to discuss these changes with @calzada first.
I believe the best solution is to leave CD
format as it is and just modify the conversion script , but you need to be up to date with my fork, because I made a lot of changes in https://github.com/matyaskopp/PARLAMINT-ES-MC/blob/master/bin/cd2parmamint.xsl
Another issue is there are so many
UNKNOWN and I don't know how to modify it step by step I expected to have a small xml sample working but I have some problems after make compilation
the UNKNOWN
party can be preserved, as you can see, conversion does not propagate it into TEI file:
https://github.com/matyaskopp/ParlaMint/blob/a10afc44515fc57d0d46196157c0d4f8d3939afb/Data/ParlaMint-ES/ParlaMint-ES-listPerson.xml
Matyas, will you be available in July?
more or less yes
Now I am implementing a script for gathering government members from wikipedia and then integrating affiliations in <listPerson>
(I hope it will not take much time - tomorrow it should be ready)
@TomazErjavec I am close to finishing all necessary scripts for producing the ParlaMint-ES corpus. Can you please take a look at the sample https://github.com/clarin-eric/ParlaMint/pull/692? If there is nothing serious before I start processing the whole corpus.
Very nice indeed! I didn't do a formal validation, as you have probably done that but I noticed a few minor things:
- for handles you could use http://hdl.handle.net/11356/1859 (TEI) and http://hdl.handle.net/11356/1860 (ana) (but finalize script inserts that anyway)
I am aware of that. I will preserve by wrong handle http://hdl.handle.net/11356/XXXX
, it is safer to have totally wrong handle, instead of pointing to some existing, but wrong handle
- utterances often have transcriber comments at the end, and, strictly speaking, they should go outside, i.e. just after the utterance; but in practice it doesn't much matter
But there will be utterances without segments or notes. We do not allow it. I have discovered a several utterances of this type: source https://www.congreso.es/public_oficiales/L14/CONG/DS/PL/DSCD-14-PL-75.PDF
<intervention id='in78'>
<speaker>
<name>Bassa Coll, Montserrat</name>
<birth_date>19650420</birth_date>
<birth_place country="ES">UNKNOWN</birth_place>
<status>NA</status>
<gender>female</gender>
<institution>
<ni country="ES">CD</ni>
</institution>
<constituency country="ES" region="Girona"/>
<affiliation>
<national_party>ERC-S</national_party>
<cd group="GR"/>
</affiliation>
<post>NA</post>
</speaker>
<speech id='sp78' language="ES">
<omit type="comment">Termina su intervención en catalán.-Aplausos</omit>.
</speech>
</intervention>
result:
<u xml:id="ParlaMint-ES_2021-02-02-CD210202.u78"
who="#MontserratBassaColl"
ana="#regular">
<vocal type="clarification">
<desc>Termina su intervención en catalán.-Aplausos</desc>
</vocal>
</u>
- more of an aesthetic issue: you have IDs like "ParlaMint-ES_2023-02-23-CD230223.u1.1.s1.w1", it would be more consistent to have "ParlaMint-ES_2023-02-23-CD230223.u1.seg1.s1.w1
Good point, I will implement it, but I will use p
prefix instead of seg
(to be consistent with UA and CZ :-))
OK, good arguments for ignoring first two suggestions, and, yes, p prefix is then indeed better for the third. Good luck!
@charlicruz, @calzada
Improve note annotations
eg:
should be
most common notes with frequencies:
Missing who when chair
Missing
who
attribute https://github.com/matyaskopp/PARLAMINT-ES-MC/blob/4dc6c5f53597e2bdc3b3925a4424cb38764a4931/ParlaMint.sample/ParlaMint-ES_2015-01-20-CD150120.xml#L100-L103source: https://github.com/matyaskopp/PARLAMINT-ES-MC/blob/4dc6c5f53597e2bdc3b3925a4424cb38764a4931/CD.sample/CD150120.xml#L57-L76
chairman name is present in source file: https://github.com/matyaskopp/PARLAMINT-ES-MC/blob/4dc6c5f53597e2bdc3b3925a4424cb38764a4931/CD.sample/CD150120.xml#L52
list of chairmans with frequencies:
No guest speakers ???
This is a bit strange. In ES parliament, there is no speaker labelled with
guest
category (ana="#guest"
)Missing parliamentaryGroups
parliamentaryGroup
it seems that source data contain parliamentary groups. They are now required (https://clarin-eric.github.io/ParlaMint/#sec-parties) in ParlaMint (parties can be converted into groups or better, encode both party+groups)
list of parliamentary groups with number of affiliated persons
Parliamentary group - party pairs:
Missing translation
https://github.com/matyaskopp/ParlaMint/blob/e48f74e3c66adb5a32b8d1051be3d2ebb58c097c/Data/ParlaMint-ES/ParlaMint-taxonomy-parla.legislature.xml#L200-L207
parliamentaryGroup affiliation overlaps
I have discovered this accidentally because it produces a different error:
for this error, there can be many reasons:
MartínezMaría
migrates between parties a lot (embodied multiple times to a single party) - the script is not able to deal with it