Closed mukeshnarendran7 closed 2 years ago
Thanks for getting back
Hi,
d
to keypoint_number
. This 1x1 conv equals a linear FC, because it is position-wise linear transformation. So you can also use a 1x1 conv with (d, 16) channels. They have the same effect to output the heatmaps.Hi, thanks once again for clarifying the issues. I have some more questions about processing.
I want to use the pre-trained model and fine-tune it for another application but I am not able to find the heatmaps preparation code reference? Is it similar to taking an image an converting the (x,y) co-ordinates to heatmaps like for CNN's pose estimation problem? The model output is of (48,64) but my input images are 256, 192. A reference will be helpful. Thanks