Open AlexCuadron opened 1 day ago
CC @xingyaoww , where do those docker images get built?
I manually build these SWE-Bench images and push them to Dockerhub since the original authors didn't release them. I think we might need the fix in #4402 to specify build arch for SWE-Bench evaluation.
Is there an existing issue for the same bug?
Describe the bug and reproduction steps
When trying to execute run_infer.sh from the swe_bench evaluation harness using this command:
./evaluation/swe_bench/scripts/run_infer.sh llm.mini HEAD CodeActAgent 100 30 1 "princeton-nlp/SWE-bench_Lite" test
The sandbox image build fails as it tries to fetch a specific image from ghcr.io:ERROR: ghcr.io/all-hands-ai/runtime:oh_v0.9.8_image_7c37a7ac_.astropy_s_astropy-12907_tag_latest: not found
The error occurs because the base image specified in the Dockerfile (ghcr.io/all-hands-ai/runtime:oh_v0.9.8_image7c37a7ac.astropy_s_astropy-12907_tag_latest) cannot be found in the container registry, causing the build to fail.However, when I use the --platform=linux/amd64 flag, the build works successfully:
docker buildx build --platform=linux/amd64 ...
This indicates that the image is only available for the amd64 architecture, and not for arm64 (the default architecture on Apple Silicon or other systems).
OpenHands Installation
Development workflow
OpenHands Version
0.9.8
Operating System
MacOS
Logs, Errors, Screenshots, and Additional Context