braintrustdata / autoevals

AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
MIT License
199 stars 17 forks source link

Try gpt-4o-mini #78

Open ankrgyl opened 3 months ago

github-actions[bot] commented 3 months ago

Braintrust eval report

Autoevals (gpt-4o-mini-1721329696) Score Average Improvements Regressions
NumericDiff 73.5% (-2pp) 25 🟢 22 🔴
Duration 4.38s - -
Estimated_cost 0$ - -