yisol / IDM-VTON

IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
https://idm-vton.github.io/
3.21k stars 491 forks source link

Some questions about the number of channels of Unet and its' initialization #39

Open Delicious-Bitter-Melon opened 2 months ago

Delicious-Bitter-Melon commented 2 months ago

Thanks for your excellent work.

Do you expand all convolutional layers of UNet to 13 channels initialized with zero weights or only expand the first convolutional layer of UNet to 13 channels? Do you use the pre-trained SDXL inpainting models to initialize the denoiser inpainting Unet?

yisol commented 2 months ago

Hello, we initialized additional channels with zero weights and used sdxl inpainting model for initialization of TryOnNet.