THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.23k stars 159 forks source link

pull request #171

Open genglongling opened 1 week ago

Xiao9905 commented 1 week ago

@genglongling Hi, thanks for your great contribution! Would you provide a short documentation for the benchmark as AvalonBench does in PR #60 for people's better understanding of the benchmark usage?