yikaiw / CEN

[TPAMI 2023, NeurIPS 2020] Code release for "Deep Multimodal Fusion by Channel Exchanging"
MIT License
284 stars 43 forks source link

Question about CEN #12

Closed cjs123456 closed 2 years ago

cjs123456 commented 2 years ago

Hello, thank you very much for the code you shared, we used your algorithm to train the model (aerial RGB and elevation modality, using ImageNet pre-trained model), but during model validation we found that the scaling factors of the two modalities are very close, and with the deepening of the convolution, the two modalities scaling factors of all channels are almost the same, the effect of exchange is not obvious. May I ask where is the problem? Thanks in advance!

yikaiw commented 2 years ago

Hi, thanks for your interest. Have you applied the sparsity constraints (L1-norm) on the scaling factors? And the sparsity constraints should be applied individually on disjoint parts of channels (as shown in Figure 2).

cjs123456 commented 2 years ago

Thank you very much for the quick response! I’m sure the sparsity constraints (L1-norm) are applied on the scaling factors(as shown in Figure 2). But the result of the scaling factors visualization is shown in the following figure, two colors represent different modalities, which are very close image

yikaiw commented 2 years ago

In this figure, it seems the sparsity constraints have little effect on the scaling factors, which is abnormal. Have you tried to increase lambda (in Eq. (4))?

cjs123456 commented 2 years ago

Thank you very much for your great suggestion, The scaling factors visualization became reasonable after I tried to increase lamda. It seems like the sparsity constraints with higher lamda increases the difference of scaling factors between different modalities and highlights the non-negligible channels, which was discussed in detail in the paper [33](Learning Efficient Convolutional Networks through Network Slimming). I also want to ask a question, have you tried using the ratio of the scaling factors to decide whether to exchange channels? i.e. use ratio instead of threshold, such as exchange channels if the ratio of the scaling factors is higher than 0.2. I did some experiments like this but it didn't work very well, may I ask what is the problem? Thank you very much!

yikaiw commented 2 years ago

Hi, during our early exploration, we also tried to exchange channels by using ratios instead of the threshold. Yet we find the current threshold-guided design is much better than ratio-guided one.

cjs123456 commented 2 years ago

Thank you very much for your reply!