Closed MrZhengXin closed 4 months ago
sure, you can set 1024 or 2048 in here https://github.com/open-compass/opencompass/blob/889e7e11409d83fe312ecc7d7f0ed8861a84cc92/configs/datasets/subjective/multiround/mtbench_single_judge_diff_temp.py#L29 we set 512 because MTbench is a multi-turn dataset and we obey the original setting in earliy MTBench
sure, you can set 1024 or 2048 in here
we set 512 because MTbench is a multi-turn dataset and we obey the original setting in earliy MTBench
好好好 不过 他们官方的长度设置 后面也改成 1024 了
ok I will change later, thanks
Prerequisite
Type
I'm evaluating with the officially supported tasks/models/datasets.
Environment
Reproduces the problem - code/configuration sample
Reproduces the problem - command or script
Reproduces the problem - error message
Incomplete generation of Qwen2-72B-Instruct.
Other information
Hi! Thanks for the brilliant work of supporting many evaluation datasets. As current LLMs evolve, the output limit of 512 in MT-Bench may no longer be suitable. Perhaps changing it to 1024 or 2048 would be more desirable?