issues
search
mtbench101
/
mt-bench-101
[ACL 2024] MT-Bench-101: A Fine-Grained Benchmark for Evaluating Large Language Models in Multi-Turn Dialogues
Apache License 2.0
46
stars
15
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Data generation
#10
abudukeyumu
closed
1 month ago
0
Question about the evaluation code.
#9
HYeCao
opened
2 months ago
4
[Bug] When evaluation, {prediction} in origin_prompt is not replaced with model's response?
#8
liuyaox
closed
4 months ago
3
OpenCompass 实现提示词格式对人不友好
#7
Leymore
closed
4 months ago
2
Mtbench101 2oc
#6
sefira
closed
4 months ago
0
Mtbench101
#5
sefira
closed
4 months ago
0
Introducing the MT-Bench-101 Beta Version!
#4
sefira
closed
5 months ago
0
Call for code and data!
#3
zemerov
closed
5 months ago
1
论文都发表了,写在论文里,github仓库为空
#2
victorjiax
closed
5 months ago
1
所以到底什么时候放出来?
#1
Trangle
closed
2 weeks ago
1