welfare-state-analytics / riksdagen-corpus

Swedish parliamentary proceedings - Riksdagens protokoll 1867-today
Other
26 stars 5 forks source link

Annotate train set for segment classification, part 2 #465

Closed ninpnin closed 4 months ago

ninpnin commented 5 months ago

Annotate sample-p-1-seed-2024-02-02-part-2.csv according to the "Segment classification of protocols" guidelines laid out in the annotation document. Fill out the correct paragraph type ("u", "note", "intro", "margin" or "title") in the "segmentation" column. The (beginning of the) text of the paragraph is available in the "text" column, and a link to the scanned PDF in the "facs" column.

Related to #189 and #462.

viremn commented 4 months ago

sample-p-1-seed-2024-02-02-part-2_partial_annotation_for_inter_annotator_agreement.csv

Here are 30 something lines for inter-annotator agreement evaluation, as requested @ninpnin

DanloeCHAN commented 4 months ago

sample-p-1-seed-2024-02-02-part-2_anotate.csv Here are the completed one @ninpnin

ninpnin commented 4 months ago

@DanloeCHAN What file format is that? Please try to export as .csv

ninpnin commented 4 months ago
Screenshot 2024-02-22 at 09 48 28 Screenshot 2024-02-22 at 09 47 20 Screenshot 2024-02-22 at 09 47 50

For the record, this is what I get.

DanloeCHAN commented 4 months ago

Im sending the message to you through Slack. Wait me for a moment @ninpnin

DanloeCHAN commented 4 months ago

hej, I use Vscode to fill in the values and until now it is 300 lines finished (will continue to do so if this works) @ninpnin sample-p-1-seed-2024-02-02-part-2_300lines.csv

ninpnin commented 4 months ago

@DanloeCHAN what's the status on this?

DanloeCHAN commented 4 months ago

@ninpnin yes, here is the finished file. please let me know whether it works on your computer. sample-p-1-seed-2024-02-02-part-2-annotated.csv