How to eval and rank model outputs?

FreedomIntelligence / LLMZoo

⚡LLM Zoo is a project that provides data, models, and evaluation benchmark for large language models.⚡

Apache License 2.0

2.93k stars 201 forks source link

Open LuciusMos opened 1 year ago

LuciusMos commented 1 year ago

I am a bit confused by so many scripts and python files. Could you please write a guideline on how to run those files? Thank a lot:)