welfare-state-analytics / riksdagen-corpus

Swedish parliamentary proceedings - Riksdagens protokoll 1867-today
Other
26 stars 5 forks source link

Annotate train set for segment classification, part 1 #464

Closed ninpnin closed 6 months ago

ninpnin commented 8 months ago

Annotate sample-p-1-seed-2024-02-02-part-1.csv according to the "Segment classification of protocols" guidelines laid out in the annotation document. Fill out the correct paragraph type ("u", "note", "intro", "margin" or "title") in the "segmentation" column. The (beginning of the) text of the paragraph is available in the "text" column, and a link to the scanned PDF in the "facs" column.

Related to #189 and #462.

ninpnin commented 7 months ago

@DanloeCHAN any updates on this?

DanloeCHAN commented 7 months ago

@ninpnin Hej. I am still working on it. And I may need the username and password to enter the link of betalab.kb.se. Which one should I use? Cause when I enter my GitHub email/ student email, it doesn't work. Also because it is in Swedish, it takes more time for me to label (I can only read in English, so I need to translate them at first). Sorry for the inconvenience. Looking forward to your reply.

ninpnin commented 7 months ago

@DanloeCHAN Hi! Fredrik's email from 2024-01-19 explains how you log in to betalab.

DanloeCHAN commented 7 months ago

@ninpnin thank you so much! I find it! It is in my different email box, now I find it!

ninpnin commented 7 months ago

@DanloeCHAN also, please reach out to me (Väinö) on slack if anything is unclear.

DanloeCHAN commented 7 months ago

@ninpnin yes I will! The thing is that I translate them into English at first so the process would be slow for me. But I would reach out to you when I meet some issues. Thank you so much!

viremn commented 7 months ago

sample-p-1-seed-2024-02-02-part-1_partial_annotation_for_inter_annotator_agreement.csv

Here are 30 something lines for inter annotator agreement evaluation, as requested @ninpnin

DanloeCHAN commented 6 months ago

@ninpnin Here is the annotated version. sample-p-1-seed-2024-02-02-part-1-annotated.csv