I want to ask after we train and finetune the model, how do we actually object detection using raw image?
What preprocessing we have to do to feed the image into the network?
At the finetune network definition file,
the input_dim are 128, 12800, 1, 1.
What those dimensions represent respectively?
I understand that 128 is the batch_size, but can't figure out how other numbers come from.
If you can provide a demo to show how to use network to do detection as R-CNN,
that will be very helpful.
Hi,
I want to ask after we train and finetune the model, how do we actually object detection using raw image? What preprocessing we have to do to feed the image into the network?
At the finetune network definition file, the input_dim are 128, 12800, 1, 1. What those dimensions represent respectively? I understand that 128 is the batch_size, but can't figure out how other numbers come from.
If you can provide a demo to show how to use network to do detection as R-CNN, that will be very helpful.
Thanks.