I'm wondering if you've tried generating heatmap of the attention weight before, like Grad-CAM. For example, after I get the attention weight of each attention block and the corresponding sourcematrix, is there any way to generate heatmap with ToMelike the following image:
Hello, Thanks for this amazing work!
I'm wondering if you've tried generating heatmap of the
attention weight
before, likeGrad-CAM
. For example, after I get theattention weight
of each attention block and the correspondingsource
matrix, is there any way to generate heatmap withToMe
like the following image:Looking forward to your reply :)
Best