facebookresearch / VisualVoice

Audio-Visual Speech Separation with Cross-Modal Consistency
Other
220 stars 35 forks source link

Questions about the testRealVideo.py #6

Closed RashadAlison closed 2 years ago

RashadAlison commented 3 years ago

RuntimeError: Given groups=1, weight of size [512, 1152, 3, 3], expected input[1, 1792, 2, 64] to have 1152 channels, but got 1792 channels instead