clarin-eric / ParlaMint

ParlaMint: Comparable Parliamentary Corpora
https://clarin-eric.github.io/ParlaMint/
41 stars 52 forks source link

BG feedback #549

Open matyaskopp opened 1 year ago

matyaskopp commented 1 year ago

Nice work, thanks. I have a few observations and comments:

corpus timespan

subtitle:

<title type="sub" xml:lang="bg">Стенограми ... България (2014-10-27 - 2022-07-29)</title>
<title type="sub" xml:lang="en">Minutes... Bulgaria (2014-10-27 - 2022-07-29)</title>

bibl:

<bibl>
  ...
  <date from="2014-10-27" to="2022-07-29">2014-10-27 - 2022-07-29</date>
</bibl>

setting has a different to date

<setting>
  ...
  <date from="2014-10-27" to="2020-07-31"></date>
</setting>

minister encoding

Every minister should be affiliated two minister-role and member-role. eg:

<person xml:id="ZhekovaSvetlana">
  <persName>
    <forename>Светлана</forename>
    <surname>Жекова</surname>
  </persName>
  <sex value="F"></sex>
  <birth when="1967-02-01">
    <placeName>София</placeName>
  </birth>
  <education>Магистър по Френска филология в СУ „Св. Климент Охридски“</education>
  <occupation>политик</occupation>
  <!-- minister affiliation: -->
  <affiliation from="2014-08-06" ref="#gov.BGGov" role="minister" to="2014-11-07">
    <roleName xml:lang="bg">Министър на околната среда и водите</roleName>
  </affiliation>
  <!-- a member of government affiliation: -->
  <affiliation from="2014-08-06" ref="#gov.BGGov" role="member" to="2014-11-07"/>
  <idno type="wikimedia" xml:lang="bg">https://bg.wikipedia.org/wiki/Светлана_Жекова</idno>
</person>

one component corresponds to a sitting

https://github.com/ivo-clark/ParlaMint/blob/7525b9dae33da2172e53c871836382e099ae3f12/Data/ParlaMint-BG/ParlaMint-BG_2014-10-29.xml#L2

<TEI ana="#reference" xml:id="ParlaMint-BG_2014-10-29" xml:lang="bg" xmlns="http://www.tei-c.org/ns/1.0">

should be

<TEI ana="#reference #parla.sitting" xml:id="ParlaMint-BG_2014-10-29" xml:lang="bg" xmlns="http://www.tei-c.org/ns/1.0">

<meeting> elements list should contain sitting. You can use the date if not any reasonable number:

<meeting ana="#parla.sitting" corresp="#BGNS" n="2014-10-29">2014-10-29</meeting>
<meeting ana="#parla.meeting.regular" corresp="#BGNS" n="2">2 пленарно заседание</meeting>
<meeting ana="#parla.session" corresp="#BGNS" n="1">1 парламентарна сесия</meeting>
<meeting ana="#parla.term" corresp="#BGNS" n="43">Четиридесет и трето народно събрание</meeting>

link to video

It would be nice to include a link to a video of the sitting: https://parliament.bg/bg/plenaryst/ns/51/ID/5306

link to the exact source

https://github.com/ivo-clark/ParlaMint/blob/7525b9dae33da2172e53c871836382e099ae3f12/Data/ParlaMint-BG/ParlaMint-BG_2014-10-29.xml#L37

<idno type="URI">https://parliament.bg/bg/plenaryst</idno>

Should be:

<idno type="URI" subtype="parliament">https://parliament.bg/bg/plenaryst/ns/51/ID/5306</idno>

It would be nice to include a link to a video of the sitting: https://parliament.bg/bg/plenaryst/ns/51/ID/5306

missing notes

you encode this https://github.com/ivo-clark/ParlaMint/blob/7525b9dae33da2172e53c871836382e099ae3f12/Data/ParlaMint-BG/ParlaMint-BG_2014-10-29.xml#L49-L50

<div type="debateSection">
  <u ana="#chair" n="93" who="#TsachevaTsetska" xml:id="ParlaMint-BG_2014-10-29.u1">...</u>

but data also contains notes: image

proceeding of sitting is not published before the day of sitting

the day of sitting or later is expected https://github.com/ivo-clark/ParlaMint/blob/7525b9dae33da2172e53c871836382e099ae3f12/Data/ParlaMint-BG/ParlaMint-BG_2022-06-29.xml#L61

<bibl>
  <title type="main">Minutes of the National Assembly of the Republic of Bulgaria</title>
  <idno type="URI">https://parliament.bg/bg/plenaryst</idno>
  <date when="2020-03-11"></date>
</bibl>