open-compass / T-Eval

[ACL2024] T-Eval: Evaluating Tool Utilization Capability of Large Language Models Step by Step
https://open-compass.github.io/T-Eval/
Apache License 2.0
235 stars 15 forks source link

请问bench里面有关于测试大语言模型翻译能力的吗?具体是哪一项 #16

Closed White-Friday closed 9 months ago

zehuichen123 commented 10 months ago

sorry...这个benchmark主要是用来评价模型调用工具能力的,暂时不考察模型的翻译能力