yuval-alaluf / hyperstyle

Official Implementation for "HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing" (CVPR 2022) https://arxiv.org/abs/2111.15666
https://yuval-alaluf.github.io/hyperstyle/
MIT License
998 stars 115 forks source link

Training Process Problem #35

Closed FeiiYin closed 2 years ago

FeiiYin commented 2 years ago

During the training process, we first achieved a rough inversion result from e4e, which is a high-fidelity result. Then we use the hypernet to predict the weight deltas of stylegan convolution kernels. During the training process, will the results first be blurry and then gradually be clear? or the image results are kept in the stylegan domain with high fidelity? Since the convolution weight deltas are hard to learn.

Thanks in advance!

yuval-alaluf commented 2 years ago

Since we learn modifications of the StyleGAN weights, if everything works correctly, the original outputs should be similar to the outputs you get from e4e. Then, throughout training, the reconstructions should gradually improve. In other words, the initial results shouldn't be blurry assuming that the reconstructions of e4e are good (as you mentioned).

FeiiYin commented 2 years ago

Thanks!