lm-sys / arena-hard-auto

Arena-Hard-Auto: An automatic LLM benchmark.
Apache License 2.0
316 stars 29 forks source link

Fix the order of questions.jsonl on Huggingface #3

Closed infwinston closed 2 months ago