Hi, I am confused about this. The output feature of HRNet-W32 has 32 channels but the targe maps has more than 51 channel (17 keypoint hmps + 17*2 offset x, y maps).
Usually, the target feature maps have more than 4~5 times the channel number of the output feature maps of the backbone network.
Hi, I am confused about this. The output feature of HRNet-W32 has 32 channels but the targe maps has more than 51 channel (17 keypoint hmps + 17*2 offset x, y maps). Usually, the target feature maps have more than 4~5 times the channel number of the output feature maps of the backbone network.