logic-star-ai / swt-bench

[NeurIPS 2024] Evaluation harness for SWT-Bench, a benchmark for evaluating LLM repository-level test-generation
https://openreview.net/forum?id=9Y8zUO11EQ&noteId=9Y8zUO11EQ
MIT License
16 stars 2 forks source link

Reproduce docker image fixes (pinning versions) from SWE-Bench #12

Closed nielstron closed 1 week ago

nielstron commented 1 week ago

This pre-emptively reproduces some of the PRs reported at SWE-bench for lacking pinned versions. List, to be updated:

TODO: