mtbench101 / mt-bench-101

[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Apache License 2.0
35 stars 4 forks source link

Call for code and data! #3

Closed zemerov closed 2 months ago

zemerov commented 4 months ago

The LLM community would higly appreciate if you release the code and data for the mt-bench-101. To the best of my knowledge it is the most comprehensive multi-turn doalogue. So I would like to run several extra models except the ones which were mentioned in the paper.

sefira commented 3 months ago

Thank you all for your interest in MT-Bench-101! We are currently organizing the code and data. In the meantime, you can contact xingyuanbu@gmail.com to obtain the beta version, which can be run on an OpenCompass-based code branch. Thank you!