MLGroupJLU / LLM-eval-survey

The official GitHub page for the survey paper "A Survey on Evaluation of Large Language Models".
https://arxiv.org/abs/2307.03109
1.38k stars 86 forks source link

Can you add SpyGame to your survey? #26

Closed Skytliang closed 3 months ago

Skytliang commented 10 months ago

Hi there,

Thanks for the effort in putting up this survey on LLMs evaluation.

I'd like to suggest adding our work, SpyGame, a framework for evaluating language model intelligence. We propose to use word guessing games to assess the language and theory of mind intelligences of LLMs.

Paper: Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models GitHub: https://github.com/Skytliang/SpyGame

YuanWu3 commented 9 months ago

Thanks for your valuable contributions, we will include your work in the upcoming version.