Toloka / crowd-kit

Control the quality of your labeled data with the Python tools you already know.
https://crowd-kit.readthedocs.io/
Other
211 stars 15 forks source link

Ordinal Labels #17

Open vikasraykar opened 2 years ago

vikasraykar commented 2 years ago

Is it possible to support aggregation of ordinal labels as a part of this toolkit via this reduction algorithm.

pilot7747 commented 2 years ago

Hello @vikasraykar!

Thank you for posting this issue. I think it's possible. Do you have any paper in mind that provides some details about this algorithm? I wonder how to set constraints for the aggregation to output Pr[yi > c1] > Pr[yi > c2] in case when c1 < c2. I'm not sure this constraint will be satisfied if we just run, for instance, separate DS aggregations for each of the binary classification tasks.

vikasraykar commented 2 years ago

A Simple Approach to Ordinal Classification Eibe Frank and Mark Hall https://www.cs.waikato.ac.nz/~eibe/pubs/ordinal_tech_report.pdf