Closed evamutz closed 2 years ago
Recreating Tigray Evaluation datasets might be a more accurate title for this PR
Stale pull request message
Merged latest changes into this branch, let me know if the merge looks okay!
@evamutz Added a quick fix for setting the label columns. Let me know if all good!
@evamutz The one failing test is some duplicates in the data, we can merge this and I'll address the issue later. Let me know if anything is left here
Everything else looks good to me! I don't see way to merge bc of failing test, but if you're able to I think we're all set
updated data for tigray, plus some small code changes (tracking number of disagreed labels in each dataset and deprecating warnings about ill-defined f-scores)