issues
search
microsoft
/
SmartPlay
SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Creative Commons Attribution 4.0 International
116
stars
14
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Replicate the scores for Table 2
#29
finalily
opened
3 months ago
3
Update README.md
#28
Holmeswww
closed
5 months ago
0
Bump gradio from 3.39.0 to 4.11.0
#27
dependabot[bot]
opened
9 months ago
0
Bump transformers from 4.31.0 to 4.36.0
#26
dependabot[bot]
opened
9 months ago
0
Update Figure
#25
Holmeswww
closed
10 months ago
0
Unable to load environments via gym.make
#24
ethanluoyc
closed
10 months ago
1
Fix packaging
#23
ethanluoyc
closed
10 months ago
2
Bump aiohttp from 3.8.5 to 3.9.0
#22
dependabot[bot]
opened
10 months ago
0
code
#21
Quester-one
closed
10 months ago
1
Update capability names for better clarity
#20
Holmeswww
closed
10 months ago
0
fix grammar which may cause parsing error in some very small LLMs not tuned to read sentences with grammatical mistakes.
#19
Holmeswww
closed
10 months ago
0
Bump aiohttp from 3.8.5 to 3.8.6
#18
dependabot[bot]
closed
10 months ago
1
Further Bug Fix
#17
Holmeswww
closed
10 months ago
0
Bug fix for MineDojo descriptor
#16
Holmeswww
closed
10 months ago
0
Integrate with LiteLLM - Evaluate 100+LLMs, 92% faster
#15
ishaan-jaff
closed
9 months ago
1
Minor fix
#14
Holmeswww
closed
11 months ago
0
Missing llm_api module
#13
weiwangorg
closed
11 months ago
2
Bump gitpython from 3.1.32 to 3.1.37
#12
dependabot[bot]
opened
12 months ago
0
Add evaluation functionality
#11
Holmeswww
closed
12 months ago
0
Evaluation code / Guidelines for adding new code missing
#10
Holmeswww
closed
12 months ago
1
Fix deprecated issue for latest Numpy
#9
Holmeswww
closed
12 months ago
0
Fix typo in stage_two.py
#8
eltociear
closed
12 months ago
0
Update README
#7
Holmeswww
closed
1 year ago
0
Setuptool fix
#6
Holmeswww
closed
1 year ago
0
OpenAI Gym version seems to cause error.
#5
Holmeswww
closed
1 year ago
2
Bump pillow from 9.4.0 to 10.0.1
#4
dependabot[bot]
opened
1 year ago
0
Update README.md
#3
Holmeswww
closed
1 year ago
0
Bump gitpython from 3.1.32 to 3.1.35
#2
dependabot[bot]
closed
12 months ago
1
Initial Commit
#1
Holmeswww
closed
1 year ago
0