Yifan-Song793 / ETO

Trial and Error: Exploration-Based Trajectory Optimization of LLM Agents (ACL 2024 Main Conference)
https://arxiv.org/abs/2403.02502
88 stars 9 forks source link

Average reward of gpt-3.5-turbo #8

Open George-Chia opened 3 months ago

George-Chia commented 3 months ago

Hello! When I run the following command:

"python -m eval_agent.main --agent_config openai --exp_config alfworld --split test --verbose"

I got the results: All tasks done. Output saved to outputs/gpt-3.5-turbo/alfworld Average reward: 0.0299 Success rate: 0.0299.

, which is much lower than the results in Table 2 of the paper. Is there any config that needs to be set?