Using Agent protocol to run benchmarks. I added downloading and uploading files, this allows to benchmark agents remotely and overall it simplifies the setup.
Other insights are more than welcomed!
Changes
Add option to run benchmarks in API mode, no need to change your agent, if you use agent protocol.
PR Quality Checklist
[x] I have run the following commands against my code to ensure it passes our linters:
Background
Using Agent protocol to run benchmarks. I added downloading and uploading files, this allows to benchmark agents remotely and overall it simplifies the setup.
Other insights are more than welcomed!
Changes
Add option to run benchmarks in API mode, no need to change your agent, if you use agent protocol.
PR Quality Checklist