THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.03k stars 138 forks source link

init commit avalon #58

Closed HenryCai11 closed 9 months ago

HenryCai11 commented 9 months ago

The Avalon task. Please refer to the README for a quick start.