Closed distbit0 closed 6 months ago
Thank you very much for your suggestions and we will continue to improve chatdev in the future
Thank you for your suggestion! We have other researches ongoing which are related to real-life issues and solutions. Right now ChatDev serves as a software-level solution and we may not test it on the SWE-Bench in the short term.
It would be interesting to see the performance on SWE-Bench benchmarks, so that this project can be more clearly differentiated from the increasing number of other coding agents.
https://www.swebench.com/
https://github.com/princeton-nlp/SWE-bench
https://arxiv.org/abs/2310.06770