openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.76k stars 2.58k forks source link

What is this #1527

Open DXv-3 opened 5 months ago

GearUnclear commented 4 months ago

Great discussion you started here. Did you read the readme yet?