microsoft / SmartPlay

SmartPlay is a benchmark for Large Language Models (LLMs). Uses a variety of games to test various important LLM capabilities as agents. SmartPlay is designed to be easy to use, and to support future development of LLMs.
Creative Commons Attribution 4.0 International
121 stars 14 forks source link

Evaluation code / Guidelines for adding new code missing #10

Closed Holmeswww closed 1 year ago

Holmeswww commented 1 year ago

We will add corresponding code in a few days

Holmeswww commented 1 year ago

resolved #11