bigscience-workshop / evaluation

Code and Data for Evaluation WG
Other
41 stars 24 forks source link

Add WinoMT to Full Benchmark #35

Open epavlick opened 3 years ago

maraimm commented 3 years ago

I can help with this!

tomlimi commented 2 years ago

I am working on a similar WinoBias dataset to evaluate gender bias in coreference resolution. The prompts were already available in promptsource.