allenai / unifiedqa

UnifiedQA: Crossing Format Boundaries With a Single QA System
https://arxiv.org/abs/2005.00700
Apache License 2.0
428 stars 43 forks source link

Evaluation scores #21

Closed NacirB closed 3 years ago

NacirB commented 3 years ago

Hey,

Is it possible for you to push the evaluation code used to calculate scores for each dataset ? ( to reproduce scores in the last table )

danyaljj commented 3 years ago

The evaluation scripts are in the repo now.