Use agent protocol for Benchmarks

Background

Using Agent protocol to run benchmarks. I added downloading and uploading files, this allows to benchmark agents remotely and overall it simplifies the setup.

Other insights are more than welcomed!

Changes

Add option to run benchmarks in API mode, no need to change your agent, if you use agent protocol.

PR Quality Checklist

[x] I have run the following commands against my code to ensure it passes our linters:

black . --exclude test.py
isort .
mypy .
autoflake --remove-all-unused-imports --recursive --ignore-init-module-imports --ignore-pass-after-docstring --in-place agbenchmark

Significant-Gravitas / Auto-GPT-Benchmarks

Use agent protocol for Benchmarks #271

Background

Changes

PR Quality Checklist