关于Audio_only和Visual_only准确率的问题

GeWu-Lab / OGM-GE_CVPR2022

The repo for "Balanced Multimodal Learning via On-the-fly Gradient Modulation", CVPR 2022 (ORAL)

MIT License

236 stars 18 forks source link

关于Audio_only和Visual_only准确率的问题 #32

Open yuquanqingyin opened 1 year ago

yuquanqingyin commented 1 year ago

请问论文Table1中的Audio-only和Visual-only是如何得到的？应该不是代码中的acc_a和acc_v吧

echo0409 commented 1 year ago

Hi, thanks for your attention. We indicate the process at the footnote of our paper.

"Here, the visual-only and the audio-only are the models that trained with a single modality. To evaluate the uni-modal encoders of the multimodal model, we observe the performance through fixing the joint-trained uni-modal encoder and finetuning a uni-modal classifier only."

yuquanqingyin commented 1 year ago

能否提供Audio-only和Visual-only的代码和超参数呢？因为我跑出来的结果是：Audio-only：60% Visual-only：20%，与论文Table1中的数据差距较大

HPU-Yz commented 1 year ago

您好，我也是遇到同样的问题，请问下能否提供Audio-only和Visual-only的代码，谢谢

HPU-Yz commented 1 year ago

能否提供Audio-only和Visual-only的代码和超参数呢？因为我跑出来的结果是：Audio-only：60% Visual-only：20%，与论文Table1中的数据差距较大

请问下您有得到吗，或者方便分享下吗，谢谢