Closed ttpro1995 closed 5 years ago
Because there are three pooling layers.
I mean author said that " we choose bilinear interpolation", does it mean bilinear interpolation be a layer in model ? So the output of last layer will have same size at input image.
The bilinear interpolation is not part of the model, the last trainable layers are the strided convolutions. You can add it as a post-processig task, however the count has to be calculated before.
yeah I think @Pipoderoso right. I code with Keras, add Upsampling layer and wonder why it does not get better (no trained).
When I seriously check Pytorch code and notice that output is 1/8 input.
I run the train_loader, with is create in train.py
The result show that resolution of ground truth density map is reduce by 1/8 comparing to input image.
As in paper, the author mentions: