All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More
https://all-hands.dev
MIT License
31.4k stars 3.62k forks source link

[Bug]: client runtime waiting too long or unrecoverable container? #3441

Open tobitege opened 1 month ago

tobitege commented 1 month ago

Is there an existing issue for the same bug?

Describe the bug

Method _wait_until_alive in client runtime retry loop can get stuck for too long on unrecoverable container. https://github.com/OpenDevin/OpenDevin/blob/9cb0bf97c1212c801037a39a9222da098d3b53d7/opendevin/runtime/client/runtime.py#L195

Experienced multiple times now, that the startup process gets stuck here, but without any error message. Normally the container should answer back within 1 minute worst case. Anything above this time basically means that the container is not recoverable and will end in an error, but after several minutes of waiting.

If the waiting time is over 2 minutes, the process should abandon the container and start fresh. That sounds harsh, but since we don't have persistent containers anymore, it'll end up the same (error -> new task).

Current OpenDevin version

0.8.3

Installation and Configuration

Either build from source or docker command.

Model and Agent

Operating System

WSL

Reproduction Steps

No specific steps found.

Logs, Errors, Screenshots, and Additional Context

No response

github-actions[bot] commented 3 days ago

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.