mtbench101 / mt-bench-101

[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Apache License 2.0
35 stars 4 forks source link

Introducing the MT-Bench-101 Beta Version! #4

Closed sefira closed 2 months ago

sefira commented 3 months ago

Thank you all for your interest in MT-Bench-101! You can contact xingyuanbu@gmail.com to obtain the beta version, which can be run on an OpenCompass-based code branch.

In the meantime, we are currently organizing the code and data for the next version.

Thank you!