THUDM / AgentBench

A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.03k stars 138 forks source link

About Webshop #91

Closed dapengchen1234 closed 6 months ago

dapengchen1234 commented 7 months ago

请问一下webshop的prompt, 存在什么地方,我们复现不了paper里面report的结果

zhc7 commented 7 months ago

https://github.com/THUDM/AgentBench/blob/adc728e073c7ba2934c5fbf05ca1eaa10cc2b21c/src/server/tasks/webshop/__init__.py#L14 Hi, here.