peteanderson80 / bottom-up-attention

Bottom-up attention model for image captioning and VQA, based on Faster R-CNN and Visual Genome
http://panderson.me/up-down-attention/
MIT License
1.43k stars 378 forks source link

run detection on my own images with pre-trained resnet model #57

Open tuyunbin opened 5 years ago

tuyunbin commented 5 years ago

Hi, I want to detect my own pictures with the resent-101 model trained by you on visual genome. I found several demo. py files in the 'tool' folder. Does demo_vg.py be used to perform detection results with the VGG model trained on genome? How do I modify this file if I want to switch to the pre-trained resnet 101 model?

a1391651300 commented 5 years ago

hello,could you please tell me how to download the pre-trained resnet101 faster rcnn model ? It seems that the download link is wrong

AlonzoLeeeooo commented 1 year ago

Hi @tuyunbin, have you solve this problem yet?