Jellyfish042 / uncheatable_eval

Evaluating LLMs with Dynamic Data
MIT License
70 stars 4 forks source link

2024-08 data #7

Closed Jellyfish042 closed 2 months ago