Open davidvfx07 opened 2 years ago
So I am using BasicSR to upscale the output of Wav2Lip to a more usable size. By training my own model every use gives me accurate teeth and eyes and really helps with the believability. Wav2Lip outputs 96x96, x4 is 384x384. I cannot seem to find a way to train ESRGAN with VGGStyleDiscriminator for 384x384.
Any help appreciated!
Hello, I trained real-esrgan in a 10 minute single person video, and the effect was that my mouth was kept closed and couldn't be opened. How can I solve this problem?
So I am using BasicSR to upscale the output of Wav2Lip to a more usable size. By training my own model every use gives me accurate teeth and eyes and really helps with the believability. Wav2Lip outputs 96x96, x4 is 384x384. I cannot seem to find a way to train ESRGAN with VGGStyleDiscriminator for 384x384.
Any help appreciated!