lgabs / gpt-resolve

Can GPT solve Brazilian university entrance exams?
MIT License
17 stars 2 forks source link

Automate Model Solution Evaluation Against Official Solutions #12

Open lgabs opened 1 week ago

lgabs commented 1 week ago

Automate the evaluation process of model-generated solutions by comparing them against official solutions. This would streamline the validation and accuracy assessment. Yet, final human review would be still be necessary to avoid mistakes, but would be a much more faster phase for reviewers.

An simple and efficient process would be: