Regarding the code - Githubissues

jiwoon-ahn / psa

Learning Pixel-level Semantic Affinity with Image-level Supervision for Weakly Supervised Semantic Segmentation, CVPR 2018

MIT License

385 stars 62 forks source link

VGG16_20M model was trained on ImageNet as deeplab v1 paper says.
I just wanted to adopt DeepLab in CAM and AffinityNet because it will be used for semantic segmentation anyway.
I think other questions are highly related to the above. To convert DeepLab as a CAM network, I followed https://arxiv.org/pdf/1701.08261.pdf. (which includes removing last max polling layer, adjusting dilation rate, etc.)
Then, I found that as the last maxpooling layer was removed, the varaince of the activation map has become different which can be simply compensated by x = torch.sqrt(x). (This is not neccessary when tyring with other backbone networks such as VGG-GAP https://github.com/metalbubble/CAM)

jiwoon-ahn / psa