openai / evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
Other
14.36k stars 2.55k forks source link

Add run_id to final_report from LocalRecorder #1452

Closed ianmckenzie-oai closed 5 months ago

ianmckenzie-oai commented 6 months ago

(Not an eval)

The final_report log line currently doesn't include the run_id, which means that if we want to work out which run it belongs to, we have to look at the run_id of other log lines in the same file. This makes it a bit harder to work with the logs, and as far as I can tell there's no downside to including the run_id.