nus-apr / auto-code-rover

A project structure aware autonomous software engineer aiming for autonomous program improvement. Resolved 30.67% tasks (pass@1) in SWE-bench lite and 38.40% tasks (pass@1) in SWE-bench verified with each task costs less than $0.7.
Other
2.67k stars 276 forks source link

Check that task ids have valid setups before running them. #23

Closed fovi-com closed 5 months ago

fovi-com commented 5 months ago

The environment setup process is complex and problems like unsupported dependency versions for some project versions in some cases can be difficult to resolve. Therefore we should be tolerant when tasks can't be run. So this change checks that there is a task setup and mapping for each task id before trying to run them.