princeton-nlp / SWE-bench

[ICLR 2024] SWE-bench: Can Language Models Resolve Real-world Github Issues?
https://www.swebench.com
MIT License
2k stars 348 forks source link

Evaluation hangs on "Building environment images" #245

Closed thetonywu closed 1 week ago

thetonywu commented 1 week ago

Describe the issue

When trying to run swebench.harness.run_evaluation, it is able to build all images except one, which it hangs on indefinitely. I tried killing the process and trying again, but it keeps getting stuck. Not sure how to debug from this point, so any help would be appreciated on this.

Suggest an improvement to documentation

No response