RefineNet - Githubissues

JunkyByte / deepcharuco

Unofficial pytorch implementation of the model proposed in Deep ChArUco: Dark ChArUco Marker Pose Estimation CVPR2019 https://arxiv.org/abs/1812.03247 for ChArUco board localization.

MIT License

33 stars 7 forks source link

Hi @arsalanshakeel thanks for your interest.

I have opted for upsampling because the architecture they presented for RefineNet is not clear to me.

Yes it seems that the architecture should be the same up to the heads, but if you input a (24,24) patch into that VGG based backbone you have a (C, 24/8, 24/8) output from the last conv. How the head should be designed to output a 4096D vector starting from this (C, 3, 3) is a mystery to me.

That's why I opted for upsampling: I remove some border pixels using padding=0 in the first few convs to obtain a (C, 16, 16) tensor. I apply max pool to (C, 8, 8) and continue by 2x upsampling 3 times to obtain a (1, 64, 64) as output.

If you have a valid idea on how to design the RefineNet feel free to discuss it

JunkyByte / deepcharuco

RefineNet #1