THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.15k stars 150 forks source link

什么时候评测一下百度文心模型? #22

Closed vaxilicaihouxian closed 11 months ago

vaxilicaihouxian commented 1 year ago

想看看文心的模型的评测结果

Xiao9905 commented 11 months ago

@vaxilicaihouxian 你好,百度文心的评测结果我们已在内部完成,将在近期更新。欢迎随时关注agentbench的仓库动态~