THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.24k stars 162 forks source link

OS-task catch errors in container init #164

Closed rjmoss closed 2 weeks ago

rjmoss commented 3 months ago

Added starting the container to the failure modes with an error message if either the init or start scripts fail.

Before this change there were the following problems: