mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.
https://mlcommons.org/ai-safety/
Apache License 2.0
62 stars 11 forks source link

Add proper anthropic SUT #675

Open wpietri opened 1 week ago