About the real-time AVSR model

mpc001 / auto_avsr

Auto-AVSR: Lip-Reading Sentences Project

Apache License 2.0

158 stars 40 forks source link

About the real-time AVSR model #19

Closed csf123123 closed 8 months ago

csf123123 commented 9 months ago

Hi , Thanks for sharing your work. May I ask when will the real-time audiovisual pretrained models be likely to be released? I download a realtime avsr model from https://download.pytorch.org/torchaudio/tutorial-assets/device_avsr_model.pt . Then I want to test the eval.py(https://github.com/pytorch/audio/tree/main/examples/avsr/eval.py ). But it was wrong when load the model

mpc001 commented 8 months ago

Hi @csf123123, we are not allowed to provide the corresponding pretrained model for avsr repository due to legal issues. The model you've linked is seperated from the repository. Please note that that model is compatible with torchaudio version 2.0.2.