Markfryazino / wav2lip-hq

Extension of Wav2Lip repository for processing high-quality videos.
534 stars 236 forks source link

Input shapes to the VGG discriminator are wrong #19

Open shehrum opened 2 years ago

shehrum commented 2 years ago

When running the model on 384 * 384 size imaged, I get the following error from VGG discriminator in the ESRGAN

assert x.size(2) == 128 and x.size(3) == 128, (f'Input spatial size must be 128x128, ' AssertionError: Input spatial size must be 128x128, but received torch.Size([4, 3, 384, 384]).

Chesuour commented 2 years ago

me too.I have the same problem. Do you know how to solve it?

yo-WASSUP commented 2 years ago

same problem

AIMads commented 1 year ago

Same here