Closed MansMeg closed 3 years ago
Are these usually included as margin notes? Parla Clarin example https://github.com/welfare-state-analytics/riksdagen-corpus/blob/dev/data/new-parlaclarin/prot-198081--43.xml#L1042 Associated image scan https://betalab.kb.se/prot-198081--43/prot_198081__43-010.jp2/_view
EDIT: fix links
Im not sure that they are always included as margin notes, but in some debates they are.
Then I'm wondering if there's a heuristic for finding them. Maybe a number and this symbol § . Otherwise more annotation is needed.
Yes. I think we would need annotation fir this. Although for debates 1993- they should be included?
Duplucate of #15
Each debate has a "title". This is relevant metadata to add in the corpus per debate.