Closed aorwall closed 1 week ago
@aorwall thanks so much for the original contribution and patience.
While we didn't merge the original code, we accounted for this feature in the new swebench=2.0.0
release.
The docker image caching mechanism takes care of this:
base
, environment
, and instance
. For running on full SWE-bench test, the number of images associated w/ each tier are as follows:
base
: 1 image that is the base image which all instances are built fromenvironment
: 60 images w/ conda environments that all together cover any and all environments used by instancesinstance
: 2294 images (one per instance), which is just env
image + installation of repository at base_commit
of instanceThe report has more advice on how to appropriately set the cache level.
In a nutshell, this feature has been incorporated in swebench>=2.0.0
. Now, with enough storage, instance-specific images can be cached, allowing 2+ evaluation runs of SWE-bench to be completed very quickly.
Reference Issues/PRs
Partly solves #104
What does this implement/fix? Explain your changes.
This change is to make it possible to reuse conda environments when running evaluation.
conda env list
command wasn't properly parsed so the existing environments was detected