Add proper anthropic SUT

mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.

https://mlcommons.org/ai-safety/

Apache License 2.0

62 stars 11 forks source link

Open wpietri opened 1 week ago