mlcommons / modelbench

Run safety benchmarks against AI models and view detailed reports showing how well they performed.
https://mlcommons.org/ai-safety/
Apache License 2.0
61 stars 10 forks source link

Stand up mistral SUT adapter for ministral 8b and mistral large #688

Open rogthefrog opened 5 hours ago

rogthefrog commented 5 hours ago

Creds in Keeper

To test: send too many requests, see how it fails, and add retry logic.

Look at the together SUT adapter for existing retry logic.

rogthefrog commented 5 hours ago

See https://github.com/mlcommons/modelbench/issues/607