multimodal-fusion Search Results

300 results
for multimodal-fusion

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jinlanfu/NERmultimodal #3

Different formula in Paper for gate fusion

Hi, In the paper for Gated Multimodal Fusion you use a bit different formula than the one you have in the code? for example concatenation between img_new_resize and tweet_new_resize became sum in t…

nooralahzadeh updated 5 years ago
1
salesforce/ALBEF #144

Architecture of ALBEF

Hello I would like to do some experiments using ALBEF model. For this I reviewed your paper as well, but I am unable to understand why first six layers of bert base was used as text encoder and why la…

Asaad-Pak updated 2 months ago
3
Roc-Ng/XDVioDet #14

Why concat rgb、flow and audio worse than rgb and audio? Have…

When I fuse rgb and audio ，the Ap of your paper is 78.64%. But if I use three multimodal, the AP is worse than your paper. In principle, more modal fusion effects will be better，the fact is not. I am …

Xpamile updated 1 year ago
2
Jaiy/Ground-aware-Seg #2

What the SUBJECT AREAS does this paper belong to?

Hi, I am going to submit my paper about semantic segmentation. I am wondering which subject should I choose. Could you please share you choice about the SUBJECT AREAS with me? Subject Areas: Deep …

linhaojia13 updated 4 years ago
7
GenjiB/LAVISH #17

Can't get the accuracy of AVE reported in the paper with vit…

Hi, We used this config to train AVE task on a 3090, and we used the procesed data you provided, but the accuracy we got is 73.31 python3 /code/AVE/main_trans.py --Adapter_downsample=8 --batch_siz…

Lecooo updated 7 months ago
3
google-research/scenic #552

[MBT] Inference only

Are there any ways to bypass the data-preprocessing step for MBT ("Attention Bottlenecks for Multimodal Fusion") if I only wanna do inference without passing in the actual data from AS? I notice the m…

BDHU updated 1 year ago
6
google-research/scenic #510

[MBT] Input Data Format for AudioSet

Hello, I'm working on reproduce the results in your paper "Attention Bottlenecks for Multimodal Fusion" and try to implement MBT for other audiovisual video classification tasks. However, the pr…

nku-zhichengzhang updated 10 months ago
2
PaddlePaddle/Paddle3D #57

When release BEVFusion and CenterFusion?

Aruen24 updated 10 months ago
4
LiBingyu01/StitchFusion-StitchFusion-Weaving-Any-Visual-Modalities-to-Enhance-Multimodal-Semantic-Segmentation #1

有关FMB数据集的训练权重文件

您好，感谢您精彩的工作，能否方便提供一下基于FMB数据集的可复现的权重文件呢？十分感谢！

LeonSakura updated 3 weeks ago
3
GeWu-Lab/OGM-GE_CVPR2022 #29

Model training problem with SGD and Adam optimizer

Hello, I'm trying to apply OGM-GE strategy to multimodal fusion network with text, video and audio modalities(e.g. MISA, MAG). However, when I use SGD optimizer, the model training process moves on wi…

zhougr18 updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...30 下一页

300 results for multimodal-fusion

300 results
for multimodal-fusion