Closed jatinganhotra closed 1 week ago
Sorry to hear that! How many processes you are using (N_PROCESS
)? Maybe you can try to tune that down to 1 to see if the issue persists?
Here's the pointer i got from claude sonnet 3.5:
This error suggests that the Docker daemon is not responding within the expected time frame (60 seconds). This could be due to several reasons: The Docker daemon is overloaded or not responding properly. There might be network issues if Docker is running on a remote machine. The system running Docker might be under heavy load, causing slow responses. There could be a large number of containers or images, causing the listing operation to take too long.
You could also try check how many containers are running in your system docker ps
-- if there's too many stale containers, you can try to stop them first to free up system resources (e.g., docker ps --format '{{.Names}}' | grep opendevin | xargs docker stop
)
BTW, we are working on a new Runtime for eval (#2404) that completely gets rid of the SSHBox that can sometimes be unstable in the coming weeks - Hopefully this can work better for these evals.
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
This issue was closed because it has been stalled for over 30 days with no activity.
Is there an existing issue for the same bug?
Describe the bug
From the paper section
4.4.3 AgentBench
- We selected the code-grounded operating system (OS) subset with 144 tasksI am trying to run the evaluation on OSBench subset of AgentBench using CodeActAgent, but when running
I get the error:
Full trace:
I've followed reinstallation guide and also restarted the server.
Prior to server restart, I was getting this error - https://docs.all-hands.dev/modules/usage/troubleshooting#unable-to-connect-to-ssh-box
but now I am getting the above error. Both errors are related to DockerSSHBox, which is different from the specialized DockerSSHBox for SWEBenchSSHBox.
Note - SWE-Bench evaluation runs fine on this server.
Current OpenDevin version
Reproduction Steps
Logs, Errors, Screenshots, and Additional Context
No response