lihong2303 / AGM

[ICCV2023] The repo for "Boosting Multi-modal Model Performance with Adaptive Gradient Modulation".
MIT License
22 stars 2 forks source link

Parameter settings of CREMA-D #5

Open nihaoxiaoli opened 8 months ago

nihaoxiaoli commented 8 months ago

May I ask what is the parameter setting for this dataset CREMA-D to reproduce the results in the paper? Additionally, what is the setting of a single mode that enables visual mode to reach 75.93? Looking forward to your reply.

lihong2303 commented 8 months ago

The modulation strength alpha is 1.5 and the end epoch is 1500. Other parameters can be found in the paper. For visual mono-modal concept, you can find it in the paper for computing mono-modal concept in late fusion. It is a different case for different fusion strategies.