issues
search
zhangxjohn
/
LLM-Agent-Benchmark-List
A banchmark list for evaluation of large language models.
Apache License 2.0
55
stars
1
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
add two paper about agent
#2
TinnkE
closed
3 months ago
0
Add '[ICLR 2024] SWE-Bench: Can Language Models Resolve Real-world Github Issues?'
#1
0xdevalias
opened
5 months ago
3