welfare-state-analytics / riksdagen-corpus

Swedish parliamentary proceedings - Riksdagens protokoll 1867-today
Other
26 stars 5 forks source link

Add debate name/purpose as metadata #11

Closed MansMeg closed 3 years ago

MansMeg commented 3 years ago

Each debate has a "title". This is relevant metadata to add in the corpus per debate.

ninpnin commented 3 years ago

Are these usually included as margin notes? Parla Clarin example https://github.com/welfare-state-analytics/riksdagen-corpus/blob/dev/data/new-parlaclarin/prot-198081--43.xml#L1042 Associated image scan https://betalab.kb.se/prot-198081--43/prot_198081__43-010.jp2/_view

EDIT: fix links

MansMeg commented 3 years ago

Im not sure that they are always included as margin notes, but in some debates they are.

ninpnin commented 3 years ago

Then I'm wondering if there's a heuristic for finding them. Maybe a number and this symbol § . Otherwise more annotation is needed.

MansMeg commented 3 years ago

Yes. I think we would need annotation fir this. Although for debates 1993- they should be included?

ninpnin commented 3 years ago

Duplucate of #15