Closed llyx97 closed 7 months ago
Thanks for your attention on our work. We use the instruction of "Please answer yes or no."
Thanks for the response.
However, the MLLMs may not strictly follow the instructions to answer "yes" or "no". For example, the MLLM may respond "The photo is taken inside a greenhouse, ..." (taken from Fig.4 in the MME paper). Could you explain more on how to convert such responses into Y/N labels?
Best regards,
In such a case, the model can not follow the simple instruction, and thus we judge the model delivers a wrong answer. Thank you.
Got it. Many thanks.
Hi, Thanks for sharing the great work! I have a question regarding the conversion of MLLM's responses into Y/N labels. Could you please provide more details on how this conversion process is implemented in MME?