Open Xpamile opened 3 years ago
When I fuse rgb and audio ,the Ap of your paper is 78.64%. But if I use three multimodal, the AP is worse than your paper. In principle, more modal fusion effects will be better,the fact is not. I am curious about this.
Dude, what kind of parameters can you run, can you share?
@Roc-Ng
When I fuse rgb and audio ,the Ap of your paper is 78.64%. But if I use three multimodal, the AP is worse than your paper. In principle, more modal fusion effects will be better,the fact is not. I am curious about this.