Closed homelifes closed 4 years ago
The trainval2014_resnet101_faster_rcnn_genome is for VQA fine-tuning, not for pre-training. So we don't need the masked RoI classification. Our pre-training is conducted on Conceptual Captions Dataset, together with text-only corpus.
Sorry, missed that! Thanks a lot for your reply
Hello. Thanks for sharing your code I downloaded
trainval2014_resnet101_faster_rcnn_genome
file from the google drive link. I could only findimage_id, image_w, image_h, num_boxes, boxes, features
inside it. But I can't find the predicted category for each box, in which you use in your masked ROI classification. May you tell me where are these located?