Open ning-mi opened 2 years ago
Hi! The VIST dataset provides five stories and three captions for each image, but I just need one of each them. Could you tell how to select the stroy and the caption from the dataset? Is it random selection?
Hi! The VIST dataset provides five stories and three captions for each image, but I just need one of each them. Could you tell how to select the stroy and the caption from the dataset? Is it random selection?