issues
search
hkust-nlp
/
AgentBoard
An Analytical Evaluation Board of Multi-turn LLM Agents
250
stars
26
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Can AgentBoard be tested in an offline environment?
#25
Jungle728
opened
1 week ago
0
Moviedb needs commercial license
#24
kesimeg
closed
1 month ago
1
(BUG FIX) ModuleNotFoundError: No module named 'playwright._impl._api_types'
#23
zhanwenchen
opened
2 months ago
1
(BUG FIX) Catching `SystemExit`s Independently from Generic `Exception`s
#22
zhanwenchen
closed
2 months ago
1
KeyError: No "additional_info" Field.
#21
zhanwenchen
closed
2 months ago
1
(Bug Fix) Fixes ModuleNotFoundError: No module named 'playwright._imp…
#20
zhanwenchen
closed
2 months ago
2
Fix template of `carry` predicate
#19
harshakokel
opened
2 months ago
0
Is here any plan for release virutualhome dataset
#18
zxia545
opened
3 months ago
0
Any plans to add new models?
#17
ryoungj
opened
3 months ago
1
mismatch between prompt and "check valid action" in babyai
#16
Fu-Dayuan
opened
4 months ago
0
Interaction trajectories available ?
#15
yananchen1989
opened
5 months ago
2
wrong data in tool-query
#14
JimSalesforce
closed
5 months ago
2
Add human and gpt-4o engines
#13
OliBomby
opened
6 months ago
0
Fix missing goal for huntdark jericho game
#12
OliBomby
opened
6 months ago
0
Fix jericho grounding accuracy
#11
OliBomby
opened
6 months ago
0
pddl和jericho的check valid action可能有bug
#10
Fu-Dayuan
opened
6 months ago
2
webshop好像也有一些bug
#9
Fu-Dayuan
closed
7 months ago
2
GPT-4 model missing
#8
yongchao98
closed
5 months ago
1
merge main to develop
#7
yc1999
closed
8 months ago
0
SR好像有一些bug
#6
Fu-Dayuan
opened
8 months ago
2
reactagent版本是还没有上传吗?
#5
Fu-Dayuan
opened
8 months ago
4
[Request] Google Gemini Pro
#4
abdinal1
closed
5 months ago
2
[Refactor Request] Structured objects/typing
#3
logan-markewich
closed
5 months ago
2
Update README.md
#2
eltociear
closed
10 months ago
0
partial import support
#1
chang-github-00
closed
10 months ago
0