Question about encoding image

eladrich / pixel2style2pixel

Official Implementation for "Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation" (CVPR 2021) presenting the pixel2style2pixel (pSp) framework

MIT License

3.2k stars 568 forks source link

Thanks for your awesome works, I have a question about GAN inversion. I used the psp to do GAN inversion in anime domain(512x512 300k images), and used pre-trained anime StyleGAN2(512x512 ). After training 100,000 iteration with batch_size=4, I observed two problems.

detaied structure of anime face(it seems that my model didn't capture the part of mouth、wink)
output is blured

Do you have any suggentstion about solving two problems? I am wondering if any parameter is set wrong or should i do more iterations or add w_norm loss? Thanks for your reply!

eladrich / pixel2style2pixel

Question about encoding image #284