issues
search
carlini
/
yet-another-applied-llm-benchmark
A benchmark to evaluate language models on questions I've previously asked them to solve.
GNU General Public License v3.0
875
stars
64
forks
source link
Update evaluator.py
#1
Closed
Evanc123
closed
7 months ago
Evanc123
commented
7 months ago
fix typo
fix typo