Closed Hodge931 closed 1 week ago
By this line: https://github.com/princeton-nlp/SWE-bench/blob/main/swebench/metrics/getters.py#L122
Does it mean if in the evaluation of an instance, one skipped test case is not in FAIL_TO_PASS or PASS_TO_PASS category, then the instance is considered as not resolved?
No response
@Hodge931 that is correct. If the test doesn't show up, we assume the outcome to be a fail.
Describe the issue
By this line: https://github.com/princeton-nlp/SWE-bench/blob/main/swebench/metrics/getters.py#L122
Does it mean if in the evaluation of an instance, one skipped test case is not in FAIL_TO_PASS or PASS_TO_PASS category, then the instance is considered as not resolved?
Suggest an improvement to documentation
No response