Closed jatinganhotra closed 1 week ago
Yes - i think that's somewhat expected behavior - docker acts weirdly when you try to run multiple images at once.
You can consider join our eval channel #remote-runtime-limited-beta to get access to our new infra for eval in parallel: https://www.all-hands.dev/blog/evaluation-of-llms-as-coding-agents-on-swe-bench-at-30x-speed
@xingyaoww just to clarify, when you say this is expected behavior, do you mean this will likely not be fixed? In the README: https://github.com/All-Hands-AI/OpenHands/tree/main/evaluation/swe_bench It specifically allows you to set number of workers
Yeah i think so - maybe we should make this clearer on the README there
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
Is there an existing issue for the same bug?
Describe the bug
Hi team,
When I am trying to run inference for SWE-Bench Lite with > 1 worker, I am getting the following error. The inference runs OK with only 1 worker, which is the default value.
./evaluation/swe_bench/scripts/run_infer.sh MODEL_CONFIG
with the default CodeActAgentI'm getting the following error
Stack trace:
STDOUT logs at the beginning
Current OpenHands version
Installation and Configuration