THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.15k stars 150 forks source link

[Bug/Assistance] document on v0.2 #55

Closed nlpcat closed 11 months ago

nlpcat commented 11 months ago

It seems like we can't run the step by step installation and tests anymore since v0.2. For example, the requirements.txt doesn't cover the package needed by the code fschat and antropic. Any new document reference for this new version? And how can we build the docker image? thanks

nlpcat commented 11 months ago

@zhc7 @Longin-Yu

Longin-Yu commented 11 months ago

Really thanks for pointing this out! We have fixed this problem.

nlpcat commented 11 months ago

@Longin-Yu it seems like I also can't find the documentation that would include the docker image build python src/server/tasks/os_interaction/images.py build -c configs/tasks/os.yaml -r . for v0.2. in addition, images.py is broken that needs a small fix to point out to useos-std or os-dev

Longin-Yu commented 11 months ago

In v0.2, the images.py file has been deprecated, and we recommend building the images directly. Another advantage of this approach is that it allows for easier bug localization in cases where issues arise due to network or environmental factors.