issues
search
openai
/
evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.36k
stars
2.55k
forks
source link
Investigate failing Assistants test
#1479
Closed
JunShern
closed
3 months ago