issues
search
princeton-nlp
/
SWE-bench
[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?
https://www.swebench.com
MIT License
1.81k
stars
311
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Install failed on instances from astropy__astropy
#127
JiyangZhang
closed
4 months ago
3
`model_name_or_path` is None when running models without adapters, causing an error in `run_evaluation.py`
#126
rucnyz
closed
3 months ago
2
what's the difference between environment_setup_commit and base_commit?
#125
ramsey-coding
closed
4 months ago
6
how to download one task instance from SWE-bench dataset?
#124
ramsey-coding
closed
4 months ago
1
What's the best way to browse the SWE-bench dataset?
#123
ramsey-coding
closed
4 months ago
2
Add error handling for repo cloning
#122
ALiersEL
closed
4 months ago
1
How can one participate in the SWE-bench leaderboard?
#121
yakami129
closed
4 months ago
3
Using `uv pip` instead of `pip` for significant speedup
#120
klieret
closed
2 months ago
2
Has anyone successfully ran an eval on patches against early versions of astropy, sympy, scipy etc? I'm really struggling to run things from earlier python versions
#119
PandelisZ
closed
3 months ago
4
swe-bench eval stops running after a point
#118
ssh-randy
closed
3 months ago
4
run_live.py: clone_repo() takes 3 positional arguments but 5 were given
#117
stevensu1977
closed
3 months ago
2
Is it possible to evaluate the train set?
#116
chriscremer
closed
3 months ago
1
Why AutoCodeRover not mentioned?
#115
kripper
closed
4 months ago
2
Dockerization of run_evaluation.py
#114
aorwall
closed
3 months ago
5
Reproducer Docker image
#113
zygi
closed
3 months ago
1
Don't clobber the containing environment
#112
waterson
closed
3 months ago
1
environment is lost when running pip install
#111
waterson
closed
3 months ago
1
Ignore not found tests
#110
aorwall
closed
5 months ago
0
Fixes to not have to reinstall testbeds and conda envs
#109
aorwall
closed
3 months ago
1
Do not attempt to mutate dataset object
#108
waterson
closed
5 months ago
0
get_eval_refs doesn't work with a dataset that's been `save_to_disk`'d
#107
waterson
closed
5 months ago
2
Update README.md, fix bug report link
#106
ollmer
closed
5 months ago
1
Share conda environment across evals
#105
waterson
closed
3 months ago
1
improve eval performance by caching per-repo/version conda environments
#104
waterson
closed
3 months ago
1
modify metrics
#103
icoderzqliu
closed
3 months ago
1
swe-bench eval stops running after a point
#102
ssh-randy
closed
4 months ago
1
Update README.md
#101
moresearch
closed
5 months ago
1
Update collection.md
#100
eltociear
closed
5 months ago
1
Dataset field & set up reliable environment
#99
Hodge931
closed
3 months ago
1
Repository not found while running python3 create_text_dataset.py
#98
hsm1997
closed
3 months ago
2
Update environment.yml
#97
moresearch
closed
5 months ago
1
`/bin/sh: pytest: command not found` when running evaluations
#96
psykhi
closed
3 months ago
3
Remove `pre_install` from install specs
#95
carlosejimenez
closed
3 months ago
1
Acr
#94
ojaffe
closed
5 months ago
0
Update evaluation.md evaluation argument hints
#93
ssh-randy
closed
5 months ago
0
Fix dev 2
#92
carlosejimenez
closed
5 months ago
2
Adding LLM tokens to the generated inference for cost calculation
#91
moresearch
closed
3 months ago
1
Fix dev
#90
carlosejimenez
closed
5 months ago
0
conda activate && conda install gxx_linux-64 gcc_linux-64 make -y' returned non-zero exit status 2 and Syntax error: "(" unexpected
#89
SZU-ZJW
closed
5 months ago
8
Running create_text_dataset.py gets Killed and takes too long
#88
vishwa27yvs
opened
5 months ago
3
inference issue run live
#87
kneeraj-AIDE
closed
2 months ago
4
Update log parsing to remove paths from test names
#86
carlosejimenez
closed
5 months ago
0
Use the more portable . instead of source
#85
kevinlu1248
closed
6 months ago
0
Fix pip/python paths in context_manager.py (fixes #55)
#84
thakkarparth007
closed
5 months ago
2
Fix env_list parsing
#83
carlosejimenez
closed
6 months ago
0
Use LogWrapper in ExecWrapper instead of logger
#82
carlosejimenez
closed
6 months ago
0
update: translate readme to Chinese.
#81
iFurySt
closed
6 months ago
1
Issue with Django gold patch results not evaluating to "Resolved" when logs say "All tests passed"
#80
brombaut
closed
3 months ago
8
lv3-0
#79
Iv3-0
closed
6 months ago
0
Fix typo in utils.py
#78
eltociear
closed
6 months ago
0
Previous
Next