wangf3014 / SCLIP

Official implementation of SCLIP: Rethinking Self-Attention for Dense Vision-Language Inference
127 stars 9 forks source link

How to visualize the attention map #10

Open jiinhui opened 9 months ago

jiinhui commented 9 months ago

Hello, I ask a simple question here. In your paper, you visualize the final layer attention maps of vanilla CLIP in Figure 2. Can you tell me how to do that specificly ?Or which tool have you used? Thanks