tencent-ailab / IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Apache License 2.0
5.08k stars 331 forks source link

How is IPA-FaceID adapter initialized? #291

Open zechenghe opened 7 months ago

zechenghe commented 7 months ago

It seems the Face-ID adapter is initialized with random weights. Therefore, before training the model is NOT identical to the T2I model and could generate artifacts? I was wondering if you have tried initializing the vision control as zeros, such that it won't affect the T2I generation at the beginning (similar to zero-convs in controlnet)? Thanks!