Reason to have a fixed inference size (473x473)

hszhao / PSPNet

Pyramid Scene Parsing Network, CVPR2017.

Other

1.59k stars 545 forks source link

I think the way Caffe is written is that each layer's Reshape() method is called exactly once during layer construction (see the Setup() method in layer.hpp), and afterwards, the shape of the blobs remain the same throughout training/testing. So by default, you can't have different input sizes for different images.

With that said, it's possible to implement your own layer, which calls the layer's Reshape() method inside each forward call. I am not 100% certain, but I have a feeling that it could negatively impact network speed.

hszhao / PSPNet

Reason to have a fixed inference size (473x473) #103