Open watsy1314 opened 1 month ago
Hello Author! I would like to ask, the source code both visual.py and audio-visual.py call sal_unet in saliency_decoder, which doesn't match the structure proposed in the paper, ah? What's going on here?
Can you provide the details?
Hello Author! I would like to ask, the source code both visual.py and audio-visual.py call sal_unet in saliency_decoder, which doesn't match the structure proposed in the paper, ah? What's going on here?