conda activate && conda install gxx_linux-64 gcc_linux-64 make -y' returned non-zero exit status 2 and Syntax error: "(" unexpected

SZU-ZJW commented 2 months ago

Describe the bug

When I use the run_evaluation.py to evaluate the results, I get an error. The dataset is SWE-bench-BM25-13K and the model is SWE-Llama-7B.

Traceback (most recent call last):
  File "/home/xxx/SWE-Bench/main/harness/run_evaluation.py", line 229, in <module>
    main(**vars(args))
  File "/home/xxx/SWE-Bench/main/harness/run_evaluation.py", line 203, in main
    pool.map(eval_engine, eval_args)
  File "/home/xxx/anaconda3/envs/swe-bench/lib/python3.9/multiprocessing/pool.py", line 364, in map
    return self._map_async(func, iterable, mapstar, chunksize).get()
  File "/home/xxx/anaconda3/envs/swe-bench/lib/python3.9/multiprocessing/pool.py", line 771, in get
    raise self._value
subprocess.CalledProcessError: Command '. /home/zjw/SWE-Bench/main/evaluation/testbed/SWE-Llama-7B/sphinx-doc__sphinx/7.1/tmp0si9huh_/miniconda3/bin/activate sphinx-doc__sphinx__7.1 && conda install gxx_linux-64 gcc_linux-64 make -y' returned non-zero exit status 2.
Another Problem is :
2024-04-09 21:15:45,153 - ERROR - Error stderr: /bin/sh: 5: /home/xxx/SWE-Bench/main/evaluation/testbed/SWE-Llama-7B/sphinx-doc__sphinx/3.2/tmpj8e2wxor/miniconda3/etc/conda/deactivate.d/deactivate-gxx_linux-64.sh: Syntax error: "(" unexpected

Looking forward to reply, any reply will be a huge help to me and worth thanking.

Steps/Code to Reproduce

code in harness/run_evaluation.py.

Expected Results

This error should not occur and the command can be executed correctly. Miniconda shouldn't have a problem either.

Actual Results

An unexpected error occurred

System Information

Linux, Python 3.9

brombaut commented 2 months ago

Is your error similar to the one in #77 or any of the issues mentioned on that thread (full disclaimer, i'm not a maintainer of swebench, just trying to get it to work successfully)

SZU-ZJW commented 2 months ago

Thank you for your reply. Regarding the second question, the suggestion I found on the Internet is to reinstall miniconda, but this obviously doesn‘’t work because miniconda is inherently temporary. There is still no good solution to the first problem.

brombaut commented 2 months ago

@SZU-ZJW What's the actual error for the first problem? You've just included the stack trace

SZU-ZJW commented 2 months ago

subprocess.CalledProcessError: Command '. /home/zjw/SWE-Bench/main/evaluation/testbed/SWE-Llama-7B/sphinx-doc__sphinx/7.1/tmp0si9huh/miniconda3/bin/activate sphinx-doc__sphinx__7.1 && conda install gxx_linux-64 gcc_linux-64 make -y' returned non-zero exit status 2. This is the reported issue, I don't know how to correct it, it should be in SWE-bench/swebench/harness There is line 389 of /context_manager.py. Faced with this problem, I am a little at a loss.

brombaut commented 2 months ago

@SZU-ZJW yeah that's the command that failed (i.e., from the python code, it tried to create a subprocess with the command . /home/zjw/SWE-Bench/main/evaluation/testbed/SWE-Llama-7B/sphinx-doc__sphinx/7.1/tmp0si9huh/miniconda3/bin/activate sphinx-doc__sphinx__7.1 && conda install gxx_linux-64 gcc_linux-64 make -y, but that command failed (i.e., it returned non-zero exit status 2), but there should be an actual error that is output saying why that command failed. Can you include the full log of that run?

SZU-ZJW commented 2 months ago

testbed_sphinx_7.1.log Of course, this is the log file about this problem.

john-b-yang commented 2 months ago

Hi @brombaut @SZU-ZJW we just released a report on the fixes we've been working on to get SWE-bench evaluation to work reliably, you can read about it here.

Based on what you've detailed in this issue, I think this is likely related to failure mode 2. It seems to me that you're running on an arm machine? Given that this command is failing:

conda install gxx_linux-64 gcc_linux-64 make -y

You could potentially try commenting out this line to see if the evaluation can still work without having to install the arch_specific_packages we specified.

SZU-ZJW commented 2 months ago

thank you for your answer, it really solve my problem!

princeton-nlp / SWE-bench