andrewowens / multisensory

Code for the paper: Audio-Visual Scene Analysis with Self-Supervised Multisensory Features
http://andrewowens.com/multisensory/
Apache License 2.0
220 stars 60 forks source link

duration_mult flag #37

Open kzhang3256 opened 4 years ago

kzhang3256 commented 4 years ago

Could you provide any explanation on using --duration_mult for audio-visual source separation? While --duration_mult 4 works well, --duration_mult 10 seems to have a worse result. The program will report an error if I use --duration_mult 12. If I only use --duration 20, the separated audio is almost the same as the source.

My goal is to do audio-visual separation for a 28 sec video.

Thanks!