Zhaozixiang1228 / MMIF-CDDFuse

[CVPR 2023] Official implementation for "CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion."
382 stars 37 forks source link

attention scores #30

Closed 123456gwp closed 10 months ago

123456gwp commented 10 months ago

Excus me, in your code,how are attention scores calculated between different patches of the same image in the attention part of your network structure? My understanding is that during the image preprocessing stage, you segmented the image into 128x128 patches, and when loading the image, the shape is (8, 1, 128, 128). Does this mean that only one patch is loaded? Then, using the OverlapPatchEmbed function, did you change the number of channels of this patch to 64 and subsequently calculate attention scores between channels?

Zhaozixiang1228 commented 10 months ago

yeah, you are right.