issues
search
THUDM
/
AgentBench
A Comprehensive Benchmark to Evaluate LLMs as Agents (ICLR'24)
https://llmbench.ai
Apache License 2.0
2.23k
stars
159
forks
source link
fix: fix AgentBench/data/os_interaction/data/4/ N11.json
#149
Open
minleminzui
opened
4 months ago
minleminzui
commented
4 months ago
the reference answer doesn't following the description
the reference answer doesn't following the description