Closed Yangr116 closed 11 months ago
Hi Rui,
This is the logic as proposed by Q-Bench.
For Q-Instruct, as IQA is not the pre-trained goal for it, we follow this evaluation strategy.
Thanks for your quick reply. According to this design, does that mean the score token equals the eos token?
Not actually. This is actually the first token MLLM responses after the pre-set starting words.
Got it. Thanks!
Hi, thanks for your insightful work, I would like to know why are you use the last logits to calculate score?