Closed johndpope closed 3 weeks ago
@kwentar / @jackailab / @Jie-zju / @robinchm / @flyingshan https://github.com/johndpope/MegaPortrait-hack/issues/14
code is SLOWLY training on my local 3090 gpu - 512x512 - i didn't test inference yet.
to run training with 256x256 - i had ripped out the avgpool - or maybe a cleaner way.... https://github.com/johndpope/MegaPortrait-hack/blob/main/model.py#L254
dont really want to burn out my gpu - but there's a hq torrent which we could use to train in the cloud.
UPDATE - i think it just blew up using too much vram. going to set the save interval to 100 (the paper uses 200,000)
contrastive loss is blowing up.