issues
search
Significant-Gravitas
/
Auto-GPT-Benchmarks
A repo built for the purpose of benchmarking the performance of agents, regardless of how they are set up and how they work.
MIT License
275
stars
76
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
adding backend and a basic ui
#309
SilenNaihin
closed
1 year ago
1
Remove skill tree sync
#308
waynehamadi
closed
1 year ago
1
init backend, fix frontend module
#307
SilenNaihin
closed
1 year ago
0
new frontend connections
#306
SilenNaihin
closed
1 year ago
1
fix eval
#305
waynehamadi
closed
1 year ago
1
Fix eval
#304
waynehamadi
closed
1 year ago
1
chore: polygpt update to include gpt4
#303
rihp
closed
1 year ago
1
Fix linter
#302
waynehamadi
closed
1 year ago
1
Fix agent protocol test
#301
waynehamadi
closed
1 year ago
1
Add safety challenge
#300
waynehamadi
closed
1 year ago
1
0.0.8
#299
waynehamadi
closed
1 year ago
1
Update .env.example
#298
westonwillingham
closed
1 year ago
1
Increase timeout
#297
waynehamadi
closed
1 year ago
1
Fix all tests skipped
#296
waynehamadi
closed
1 year ago
1
Release 0.0.7
#295
waynehamadi
closed
1 year ago
1
Fix all tests skipped
#294
waynehamadi
closed
1 year ago
1
Use index.html instead of dependencies.html
#293
waynehamadi
closed
1 year ago
1
No need to push skill tree twice
#292
waynehamadi
closed
1 year ago
1
Remember goal loss
#291
waynehamadi
closed
1 year ago
1
If regression tests empty continue
#290
waynehamadi
closed
1 year ago
1
Sync skill tree to a versioned website
#289
waynehamadi
closed
1 year ago
1
Update beebot
#288
erik-megarad
closed
1 year ago
0
Cleanup skill tree
#287
waynehamadi
closed
1 year ago
0
feat: add ethereum price challenge
#286
rihp
closed
1 year ago
1
Add more fields to gdrive
#285
waynehamadi
closed
1 year ago
1
Implement the 'explore' mode
#284
waynehamadi
closed
1 year ago
1
Removed accidentally added reports
#283
nerfZael
closed
1 year ago
0
Remove baserun because api key issue
#282
waynehamadi
closed
1 year ago
0
Update beebot
#281
erik-megarad
closed
1 year ago
0
Release 0.0.4
#280
waynehamadi
closed
1 year ago
0
See the task when clicking in the skill tree
#279
waynehamadi
closed
1 year ago
0
Use agent protocol
#278
jakubno
closed
1 year ago
0
Fix send to gdrive
#277
waynehamadi
closed
1 year ago
1
Put back mini agi to original state
#276
waynehamadi
closed
1 year ago
1
Integrate baserun
#275
waynehamadi
closed
1 year ago
1
Integrate with baserun
#274
waynehamadi
closed
1 year ago
1
PolyGPT Benchmarks and Submodule Update
#273
rihp
closed
1 year ago
2
Add web app creation challenge
#272
waynehamadi
closed
1 year ago
1
Use agent protocol for Benchmarks
#271
jakubno
closed
1 year ago
0
AUTO-25: Add the ability to run multiple categories and to skip categories
#270
Swiftyos
closed
1 year ago
1
Adding PolyGPT to the submodules and CI
#269
rihp
closed
1 year ago
0
Update pr template
#268
waynehamadi
closed
1 year ago
1
Add product advisor tests
#267
waynehamadi
closed
1 year ago
0
Fix test write file
#266
waynehamadi
closed
1 year ago
1
Kill all subprocesses
#265
erik-megarad
closed
1 year ago
0
Remove graphql logs
#264
waynehamadi
closed
1 year ago
1
Helicone Lock Manager fix
#263
waynehamadi
closed
1 year ago
1
Remove space challenges
#262
waynehamadi
closed
1 year ago
0
Feat: --cutoff and "keep_workspace_files" options
#261
lc0rp
closed
1 year ago
0
Add all agent protocol tests
#260
waynehamadi
closed
1 year ago
0
Previous
Next