lm-sys / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.
Apache License 2.0
316 stars 29 forks source link

Update gen_judgment.py #24

Closed r4dm closed 1 month ago

r4dm commented 1 month ago

fix for a local model to work as a judge

[(https://github.com/lm-sys/arena-hard-auto/issues/20)]

CodingWithTim commented 1 month ago

Thanks!