Closed ssh-randy closed 5 months ago
QOL update to evaluation readme, so users know they can directly use the string "test" or "dev" for the swe_bench_task arg, so that users don't need to upload any files.
What does this implement/fix? Explain your changes.
QOL update to evaluation readme, so users know they can directly use the string "test" or "dev" for the swe_bench_task arg, so that users don't need to upload any files.