blarApp / code-base-agent

Code agents for LLMs
https://blar.io
MIT License
43 stars 11 forks source link

Evaluate performance against SWE-Bench #70

Open 0xdevalias opened 5 months ago

0xdevalias commented 5 months ago

It would be interesting to see if/how blar performs against the SWE-Bench benchmarks:

berrazuriz1 commented 5 months ago

We're on it and can keep you updated. We've created a Discord server where we will post our progress.

0xdevalias commented 5 months ago

@berrazuriz1 Sounds good; though I generally find Discord a super noisy/inefficient way to try and follow updates (particularly given how every project/etc seems to have one these days).

Hopefully you can post any 'major progress' milestones to this issue/similar as well?