I thought COCO dataset has 5 captions per image.
But *.json files show that some images have more than 5 captions. Is this normal?
To reproduce
import json
with open('data/lxmert/mscoco_minival.json') as f:
data = json.load(f)
for datum in data:
coco_sents = datum['sentf']['mscoco']
if len(coco_sents) > 5:
print(datum['img_id'])
print(coco_sents)
I thought COCO dataset has 5 captions per image. But *.json files show that some images have more than 5 captions. Is this normal?
To reproduce