allenai / WildBench

Benchmarking LLMs with Challenging Tasks from Real Users
https://huggingface.co/spaces/allenai/WildBench
Apache License 2.0
194 stars 36 forks source link

add gemini-1.5 #11

Closed da03 closed 5 months ago