junwenxiong / diff_sal

Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
20 stars 1 forks source link

about the structure of sal_unet #6

Open watsy1314 opened 1 month ago

watsy1314 commented 1 month ago

Hello Author! I would like to ask, the source code both visual.py and audio-visual.py call sal_unet in saliency_decoder, which doesn't match the structure proposed in the paper, ah? What's going on here?

junwenxiong commented 1 month ago

Can you provide the details?