All-Hands-AI / OpenHands

🙌 OpenHands: Code Less, Make More
https://all-hands.dev
MIT License
36.59k stars 4.15k forks source link

(eval) incorporate SimpleBench #5074

Closed tobitege closed 2 hours ago

tobitege commented 2 hours ago

What problem or use case are you trying to solve?

Add SimpleBench to eval suite.

tobitege commented 2 hours ago

Ah, my bad, this isn't for agentic benches, nvm.