Closed mbchang closed 6 months ago
Thank you for bringing this to our attention.
The examples you've pointed out, including question 699, originate from the source dataset, KVQA. This dataset occasionally includes questions with ambiguous information or unclear instructions, particularly regarding the calculation of age gaps.
The confusion primarily stems from the perspective of "left" and "right" in the image, as well as the lack of clear instructions on how to calculate the age gap. The ground truth answer of "0" for question 699 indeed suggests a perspective based on the direction the subjects are facing, rather than the viewer's perspective.
Considering the need for consistent evaluation of current models, we chose to keep the raw format of the examples as they appear in the source dataset for now. However, we greatly appreciate your valuable suggestions and recognize the importance of more precise and unambiguous wording in these questions.
Question 699 asks:
What is the age gap between the center and the rightmost person? (unit: years)
.Context:
Issues with this question:
Questions 614, 367, 311, 398, 405, 518, 70, 208, 317, 946, 741, 745, 381, 473, 158, 41, 792, 845, 864, 988, 830, 795, 299, 240, 859, 838, 42, 788, 417, 313, 433, 126, 428, 366, 680, 60, 36, 590, 53, 960, 261, 27, 503, 699, 438, 29, 115, 500, 945 are also "age gap" questions that have similar issues.