It is unclear how to achieve temporal alignment between video and audio stream

YUCHEN005 / MIR-GAN

Code for paper "MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition"

Other

16 stars 1 forks source link

Open LindgeW opened 4 months ago

LindgeW commented 4 months ago

please give more details