issues
search
microsoft
/
SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Creative Commons Attribution 4.0 International
121
stars
14
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Replicate the scores for Table 2
#29
finalily
opened
5 months ago
3
Update README.md
#28
Holmeswww
closed
7 months ago
0
Bump gradio from 3.39.0 to 4.11.0
#27
dependabot[bot]
opened
11 months ago
0
Bump transformers from 4.31.0 to 4.36.0
#26
dependabot[bot]
opened
11 months ago
0
Update Figure
#25
Holmeswww
closed
11 months ago
0
Unable to load environments via gym.make
#24
ethanluoyc
closed
11 months ago
1
Fix packaging
#23
ethanluoyc
closed
11 months ago
2
Bump aiohttp from 3.8.5 to 3.9.0
#22
dependabot[bot]
opened
11 months ago
0
code
#21
Quester-one
closed
12 months ago
1
Update capability names for better clarity
#20
Holmeswww
closed
1 year ago
0
fix grammar which may cause parsing error in some very small LLMs not tuned to read sentences with grammatical mistakes.
#19
Holmeswww
closed
1 year ago
0
Bump aiohttp from 3.8.5 to 3.8.6
#18
dependabot[bot]
closed
11 months ago
1
Further Bug Fix
#17
Holmeswww
closed
1 year ago
0
Bug fix for MineDojo descriptor
#16
Holmeswww
closed
1 year ago
0
Integrate with LiteLLM - Evaluate 100+LLMs, 92% faster
#15
ishaan-jaff
closed
11 months ago
1
Minor fix
#14
Holmeswww
closed
1 year ago
0
Missing llm_api module
#13
weiwangorg
closed
1 year ago
2
Bump gitpython from 3.1.32 to 3.1.37
#12
dependabot[bot]
opened
1 year ago
0
Add evaluation functionality
#11
Holmeswww
closed
1 year ago
0
Evaluation code / Guidelines for adding new code missing
#10
Holmeswww
closed
1 year ago
1
Fix deprecated issue for latest Numpy
#9
Holmeswww
closed
1 year ago
0
Fix typo in stage_two.py
#8
eltociear
closed
1 year ago
0
Update README
#7
Holmeswww
closed
1 year ago
0
Setuptool fix
#6
Holmeswww
closed
1 year ago
0
OpenAI Gym version seems to cause error.
#5
Holmeswww
closed
1 year ago
2
Bump pillow from 9.4.0 to 10.0.1
#4
dependabot[bot]
opened
1 year ago
0
Update README.md
#3
Holmeswww
closed
1 year ago
0
Bump gitpython from 3.1.32 to 3.1.35
#2
dependabot[bot]
closed
1 year ago
1
Initial Commit
#1
Holmeswww
closed
1 year ago
0