Details of Diffusion Model Training based Paint By Example

bcmi / DCI-VTON-Virtual-Try-On

[ACM Multimedia 2023] Taming the Power of Diffusion Models for High-Quality Virtual Try-On with Appearance Flow.

MIT License

387 stars 56 forks source link

Hi @EricShow, thank you for your interest in our work. The problem of facial deformation caused by VAE does exist, but we did not pay too much attention to it. When inference, we chose to paste back the area outside the inpainting mask directly. Of course, there are also some ways to solve this problem. Simply, we can increase the image resolution, and the loss caused by VAE at 1024 resolution will basically not affect the image quality. You can also refer to Asymmetric_VQGAN to retrain the decoder separately to solve this problem.

bcmi / DCI-VTON-Virtual-Try-On

Details of Diffusion Model Training based Paint By Example #5