More elaborate private tests, saner public tests.

mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.

https://mlcommons.org/ai-safety/

Apache License 2.0

62 stars 11 forks source link

Closed wpietri closed 2 weeks ago

wpietri commented 2 weeks ago

The public version of this is mainly cleanup; the private version adds more tests.

github-actions[bot] commented 2 weeks ago

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅