Missing `validation.ipynb`?

Thanks for pointing this out @xingyaoww! We focused on better evaluation with the new harness, but as a result, the validation capabilities that our original repository had have not been recovered with the new harness. Since pretty much everyone using SWE-bench has focused on evaluation, we did not make updating validation a top priority.

It shouldn't be too difficult to re-incorporate. An updated validation pipeline + new SWE-bench eval data is something we're actively working on right now, and we'll release both together in the near future! (Latest by the end of summer ~August for sure).

However, if you'd like to run validation at this time, the best option at the moment would be to check out an older version of SWE-bench and use the validation there, but given all the fixes to SWE-bench, I think that form of collection can be prone to reproducibility issues as we've seen 😅

Closing this for now, but please feel free to re-open with follow ups (+ we can always discuss more in PM too!).

princeton-nlp / SWE-bench

Missing `validation.ipynb`? #163

Describe the issue

Suggest an improvement to documentation