fabienbaradel / object_level_visual_reasoning

Pytorch Implementation of "Object level Visual Reasoning in Videos", F. Baradel, N. Neverova, C. Wolf, J. Mille, G. Mori , ECCV 2018
172 stars 20 forks source link

how to train mask-r-cnn #5

Open JackSparrow3 opened 5 years ago

JackSparrow3 commented 5 years ago

Hi Fabien, thank you for your works! the paper doesn't mention how to train a mask-r-cnn since VLOG dataset does has any mask annotation, i wonder if your team has annotate the mask by yourself or use other open datasets to train mask-r-cnn? thanks again!

fabienbaradel commented 5 years ago

Hi @JackSparrow3 Thanks :) You can find the annotations here: https://fabienbaradel.github.io/masks_data/ We have used MaskRCNN pretrained on COCO. We did not train MaskRCNN on our datasets (VLOG, Something Something, EPIC) because we do not have access to object labels. We have only used the prediction made by Maskrcnn. I hope it is clear now. In case you want to train MaskRCNN you can have a look at this repo: https://github.com/facebookresearch/maskrcnn-benchmark

Fabien