Hi, Zhipeng. I have an issue about the Figure2(c) in the original paper. How to perform Grad-CAM for each frame in a video? For the video models, such as NL-101, NL-50,TPN-101,TPN-50, the temporal dimension will decrease (32->16->8->4...) as the network forward. Thank you for your reply.
Hi, Zhipeng. I have an issue about the Figure2(c) in the original paper. How to perform Grad-CAM for each frame in a video? For the video models, such as NL-101, NL-50,TPN-101,TPN-50, the temporal dimension will decrease (32->16->8->4...) as the network forward. Thank you for your reply.