Closed tbbbk closed 2 months ago
Thanks for the attention! We first calculate the QK attention scores and record the indexes, then crop the image based on the indexes, finally merge the cropped images with original image (adjust transparency) in the 'Visio'.
Hello! Thank u guys for such a amazing work. However, I am still confused about some details about the experiments:
Could you tell me how did you visualize these pictures.