DS-1000 evaluation - Githubissues

nlpxucan / WizardLM

LLMs build upon Evol Insturct: WizardLM, WizardCoder, WizardMath

9.11k stars 711 forks source link

DS-1000 evaluation #205

Open TahaBinhuraib opened 10 months ago

TahaBinhuraib commented 10 months ago

The paper presented evaluations on the ds-1000 benchmark, but I can't find a script to reproduce the results. It would be quite helpful if you could provide the code for evaluating these models on the ds1000 benchmark.