improve eval performance by caching per-repo/version conda environments

Describe the feature

Right now running an eval (e.g., using the SWE-agent evaluation/evaluation.py script) runs in such a way that a temporary conda environment is created each time you run an eval. It seems like the conda environments could be created once per repo/version, and then reused again and again across different evaluations.

Potential Solutions

One way to do this (for which I'll attach a PR) is to simply configure a reaonable path_conda in the eval args; e.g.,

args.path_conda = os.path.join(testbed, "conda", repo.rsplit('__', 1)[-1], version)

princeton-nlp / SWE-bench

improve eval performance by caching per-repo/version conda environments #104

Describe the feature

Potential Solutions