mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.
https://mlcommons.org/ai-safety/
Apache License 2.0
62 stars 11 forks source link

More elaborate private tests, saner public tests. #679

Closed wpietri closed 2 weeks ago

wpietri commented 2 weeks ago

The public version of this is mainly cleanup; the private version adds more tests.

github-actions[bot] commented 2 weeks ago

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅