aymeric-roucher / agent_reasoning_benchmark

🔧 Compare how Agent systems perform on several benchmarks. 📊🚀
Apache License 2.0
41 stars 5 forks source link