Add persona and persona*hazard breakdown to benchmark grading functions

mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.

https://mlcommons.org/ai-safety/

Apache License 2.0

62 stars 11 forks source link

Open rogthefrog opened 3 weeks ago