issues
search
allenai
/
WildBench
Benchmarking LLMs with Challenging Tasks from Real Users
https://huggingface.co/spaces/allenai/WildBench
Apache License 2.0
194
stars
36
forks
source link
add gemini-1.5
#11
Closed
da03
closed
5 months ago