swerik-project / riksdagen-records

0 stars 1 forks source link

Non-speech text in `<seg>` and `<u>` elems #14

Open BobBorges opened 2 months ago

BobBorges commented 2 months ago

I noticed working with interpellation questions that pre199495, questions submitted in writing that are included in protocols are often annotated with seg and u elements, though they are not representing speech events. It's probably part of a larger issue, not only related to these debates:

BobBorges commented 1 month ago

https://github.com/welfare-state-analytics/riksdagen-corpus/issues/296