THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.01k stars 136 forks source link

Please check my problem description and corresponding check code #137

Closed StupiddCupid closed 1 month ago

StupiddCupid commented 1 month ago

Test Cases(Prob Description&Check Code).pdf