allenai / WildBench

Benchmarking LLMs with Challenging Tasks from Real Users
https://huggingface.co/spaces/allenai/WildBench
Apache License 2.0
177 stars 25 forks source link

Add princeton-nlp/gemma-2-9b-it-DPO and princeton-nlp/gemma-2-9b-it-SimPO #17

Closed xiamengzhou closed 1 month ago

xiamengzhou commented 1 month ago

Open-weight models:

Misc.

yuchenlin commented 1 month ago

thank you!