felipemaiapolo / tinyBenchmarks

Evaluating LLMs with fewer examples
MIT License
106 stars 11 forks source link