oddguan / Audio-Visual-Video-Caption

Pytorch implementation of audio-visual fusion video captioning model
25 stars 8 forks source link