Open mustafaadogan opened 11 months ago
First, I utilize GPT-4 to transform the appropriate questions and answers related to Dimension 3 (Instance Attribute) and Dimension 5 (Instance Counting) into declarative statements. Next, I divide the attributes into 20 categories using GPT-4 and allow it to classify the corresponding declarative sentences for each attribute. Lastly, I apply filtering within each category to obtain the final set of questions for in-context captioning.
Hello! First and foremost, I'd like to congratulate you on this incredible work. I have a question regarding the creation of the dataset for In-Context Captioning and Interleaved Image-Text Analysis dimensions. How were the in-context examples chosen during this process?