mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.
https://mlcommons.org/ai-safety/
Apache License 2.0
62 stars 11 forks source link

Add persona and persona*hazard breakdown to benchmark grading functions #666

Open rogthefrog opened 3 weeks ago