Label aggregation error?

kanishk-adapt / semeval-task10

Repo for SemEval Task #10 EDOS 2023. created and maintained for DCU - ADAPT submissions

Other

0 stars 0 forks source link

@jowagner I found a total of 946 cases. Of which, 'train': 697, 'test': 174, 'dev': 75. See .csv attached.

Columns denote, ind_sexist_count -> number of times labelled as 'sexist';
ind_not-sexist_count -> number of times labelled as 'not sexist'; agg_label -> aggregated label

problematic_cases_946.csv

The task organisers state the following in their paper, Three annotators labelled each entry. To further ensure label quality, we rely on expert adjudication for disagreements. Experts were called upon to give labels for (i) cases with less than 3/3 agreement (unanimous) in Task A, and (ii) cases with less than 2/3 agreement in Tasks B and C. https://arxiv.org/pdf/2303.04222.pdf (3.4 Annotator Process)

I do not know how to pose the question for

``what was the reasoning of expert annotators'', ?
``were the decisions noted as comments" ?

kanishk-adapt / semeval-task10

Label aggregation error? #40