Open lucasjinreal opened 2 months ago
Hi, the max_new_tokens
we sets in cmmmu is 16 since it should only output answer choices such as A, B, C, D
. If you want to do generation on it, you can set this parameter higher
Hi, looks like my small models hard to follow the instructing precisely. Am just curious if it possible to edit the prompt in question, such like: 请回答下列选择题,请直接回答选项字母。
or Answer with the option's letter from the given choices directly.
just like VLMEvalKit does?
For some of the tasks, we have implemented the model_specific_kwargs
but sadly this is not included in the cmmmu task.
For now you can try to hardcode your prompt in this function
OK, I changed the prompt into:
# "task_instructions": [
# "请回答以下多项选择题,并选出正确选项。这些题目可能包括单选和多选题型。如果所提供的信息不足以确定一个明确的答案,那么请根据可用的数据和你的判断来选择最可能正确的选项。",
# "请回答以下判断题,并根据题目描述和所给的信息来判断问题中陈述的对错。如果信息不完整或不足以作出绝对判断,请运用你的逻辑推理和现有信息来做出最可能的判断。",
# "请回答以下填空题,并根据题目的要求和所提供的信息来给出最恰当的答案。如果信息不足以确切回答,那么请依据现有的数据和你的推理能力来填写最合理的答案。",
# ],
"task_instructions": [
"请回答以下多项选择题,并选出正确选项。你只需要回答正确选项对应的字母。可能为单选也可能为多选。",
"请回答以下判断题,仅需要回答对或者错。",
"请回答以下填空题,填写空白处正确的内容。",
],
It boost my model performance on CMMMU by 2 points....
生成的结果中,有很多这样的截断:
实际推理这张图片的时候是可以完整输出的,这种原因是为啥?