princeton-nlp SWE-bench issues

princeton-nlp / SWE-bench

[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?

https://www.swebench.com

MIT License

1.81k stars 311 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Install failed on instances from astropy__astropy

#127 JiyangZhang closed 4 months ago
3
`model_name_or_path` is None when running models without adapters, causing an error in `run_evaluation.py`

#126 rucnyz closed 3 months ago
2
what's the difference between environment_setup_commit and base_commit?

#125 ramsey-coding closed 4 months ago
6
how to download one task instance from SWE-bench dataset?

#124 ramsey-coding closed 4 months ago
1
What's the best way to browse the SWE-bench dataset?

#123 ramsey-coding closed 4 months ago
2
Add error handling for repo cloning

#122 ALiersEL closed 4 months ago
1
How can one participate in the SWE-bench leaderboard?

#121 yakami129 closed 4 months ago
3
Using `uv pip` instead of `pip` for significant speedup

#120 klieret closed 2 months ago
2
Has anyone successfully ran an eval on patches against early versions of astropy, sympy, scipy etc? I'm really struggling to run things from earlier python versions

#119 PandelisZ closed 3 months ago
4
swe-bench eval stops running after a point

#118 ssh-randy closed 3 months ago
4
run_live.py: clone_repo() takes 3 positional arguments but 5 were given

#117 stevensu1977 closed 3 months ago
2
Is it possible to evaluate the train set?

#116 chriscremer closed 3 months ago
1
Why AutoCodeRover not mentioned?

#115 kripper closed 4 months ago
2
Dockerization of run_evaluation.py

#114 aorwall closed 3 months ago
5
Reproducer Docker image

#113 zygi closed 3 months ago
1
Don't clobber the containing environment

#112 waterson closed 3 months ago
1
environment is lost when running pip install

#111 waterson closed 3 months ago
1
Ignore not found tests

#110 aorwall closed 5 months ago
0
Fixes to not have to reinstall testbeds and conda envs

#109 aorwall closed 3 months ago
1
Do not attempt to mutate dataset object

#108 waterson closed 5 months ago
0
get_eval_refs doesn't work with a dataset that's been `save_to_disk`'d

#107 waterson closed 5 months ago
2
Update README.md, fix bug report link

#106 ollmer closed 5 months ago
1
Share conda environment across evals

#105 waterson closed 3 months ago
1
improve eval performance by caching per-repo/version conda environments

#104 waterson closed 3 months ago
1
modify metrics

#103 icoderzqliu closed 3 months ago
1
swe-bench eval stops running after a point

#102 ssh-randy closed 4 months ago
1
Update README.md

#101 moresearch closed 5 months ago
1
Update collection.md

#100 eltociear closed 5 months ago
1
Dataset field & set up reliable environment

#99 Hodge931 closed 3 months ago
1
Repository not found while running python3 create_text_dataset.py

#98 hsm1997 closed 3 months ago
2
Update environment.yml

#97 moresearch closed 5 months ago
1
`/bin/sh: pytest: command not found` when running evaluations

#96 psykhi closed 3 months ago
3
Remove `pre_install` from install specs

#95 carlosejimenez closed 3 months ago
1
Acr

#94 ojaffe closed 5 months ago
0
Update evaluation.md evaluation argument hints

#93 ssh-randy closed 5 months ago
0
Fix dev 2

#92 carlosejimenez closed 5 months ago
2
Adding LLM tokens to the generated inference for cost calculation

#91 moresearch closed 3 months ago
1
Fix dev

#90 carlosejimenez closed 5 months ago
0
conda activate && conda install gxx_linux-64 gcc_linux-64 make -y' returned non-zero exit status 2 and Syntax error: "(" unexpected

#89 SZU-ZJW closed 5 months ago
8
Running create_text_dataset.py gets Killed and takes too long

#88 vishwa27yvs opened 5 months ago
3
inference issue run live

#87 kneeraj-AIDE closed 2 months ago
4
Update log parsing to remove paths from test names

#86 carlosejimenez closed 5 months ago
0
Use the more portable . instead of source

#85 kevinlu1248 closed 6 months ago
0
Fix pip/python paths in context_manager.py (fixes #55)

#84 thakkarparth007 closed 5 months ago
2
Fix env_list parsing

#83 carlosejimenez closed 6 months ago
0
Use LogWrapper in ExecWrapper instead of logger

#82 carlosejimenez closed 6 months ago
0
update: translate readme to Chinese.

#81 iFurySt closed 6 months ago
1
Issue with Django gold patch results not evaluating to "Resolved" when logs say "All tests passed"

#80 brombaut closed 3 months ago
8
lv3-0

#79 Iv3-0 closed 6 months ago
0
Fix typo in utils.py

#78 eltociear closed 6 months ago
0

Previous Next