THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.01k stars 136 forks source link

urgent - if there one of the problems throws an error , why does the overall.json not show up?? #144

Open ishapuri opened 2 weeks ago