mlcommons / modelgauge

Make it easy to automatically and uniformly measure the behavior of many AI Systems.
https://mlcommons.org/ai-safety/
Apache License 2.0
26 stars 7 forks source link

Report *accurate* num items scored in test result + add num items that *were not* scored #526

Open bkorycki opened 3 months ago