openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.36k stars 2.55k forks source link

Updates for Solvers #1461

Closed JunShern closed 5 months ago

JunShern commented 5 months ago

We provide an update to our Solvers infrastructure