I need to have the option to evaluate the Benchmark with an Open Source Model as LLM-Judge.
How Can I do that, if this is note possible shall we work on a PR?
I have started a PR: https://github.com/Psycoy/MixEval/pull/46
Reason for this issue...We might face:
HI @Psycoy
I need to have the option to evaluate the Benchmark with an Open Source Model as LLM-Judge.
How Can I do that, if this is note possible shall we work on a PR?I have started a PR: https://github.com/Psycoy/MixEval/pull/46 Reason for this issue...We might face:regards Carsten