we would like to use densecap to predict caption for already computed bounding boxes.
I tried using the im_detect() function in /lib/fast_rcnn/test.py which has a boxes argument.
I would expect the function to output the same number of boxes as i put in, which does not happen. Instead, it looks like the network predicts new boxes in the RPN.
I tried setting the cfg.TEST.HAS_RPN parameter to False in order to load the rois blobs in the _get_blobs() function -> The boxes are loaded into blobs, but this has no effect on the outcome. Are they used at all in this case?
Do i need to adjust the feature_net (vgg_region_global_feature.prototxt) in some way or set some other parameters in order for the network to work as expected? Or did I miss something else?
Hi,
we would like to use densecap to predict caption for already computed bounding boxes. I tried using the
im_detect()
function in/lib/fast_rcnn/test.py
which has aboxes
argument.I would expect the function to output the same number of boxes as i put in, which does not happen. Instead, it looks like the network predicts new boxes in the RPN.
I tried setting the
cfg.TEST.HAS_RPN
parameter toFalse
in order to load the rois blobs in the_get_blobs()
function -> The boxes are loaded intoblobs
, but this has no effect on the outcome. Are they used at all in this case?Do i need to adjust the feature_net (vgg_region_global_feature.prototxt) in some way or set some other parameters in order for the network to work as expected? Or did I miss something else?
Thanks