Psycoy / MixEval

The official evaluation suite and dynamic data release for MixEval.
https://mixeval.github.io/
203 stars 29 forks source link

Question? #28

Closed aallsbury closed 1 month ago

aallsbury commented 1 month ago

Sorry guys, new to evals and benchmarks, I was hoping someone could point me in the right direction. I am currently trying to evaluate the baseline quality of several open-source model quants. I have them hosted via openai api (Oobabooga) at a url with an api key. Is there an easy way to alter this program to test my different models at the same api end point? I was able to alter MMLU-Pro to do this, but can't seem to figure it out with MixEval. Any help would be appreciated, thank you!

I am planning to test: Llama 3.1 70b Instruct EXL2 4.0bpw Mixtral 8x7b Instruct EXL2 6.0bpw Gemma 2 27B IT EXL2 8.0bpw

Thank you!

Psycoy commented 1 month ago

Hi,

Check out this issue, it might help!