carlini / yet-another-applied-llm-benchmark

A benchmark to evaluate language models on questions I've previously asked them to solve.
GNU General Public License v3.0
875 stars 64 forks source link

Update evaluator.py #1

Closed Evanc123 closed 7 months ago

Evanc123 commented 7 months ago

fix typo