Worse image quality when combine with ipadapter

non-Ella workflow result combined with ipadapter: Ella workflow result combined with ipadapter: ipadapter reference style image: china_texture JPEG the Ella workflow image result is far less realistic than non-Ella workflow when the reference style image mostly contains unrealistic textures. I guess this is somewhat because the ipadapter's overfiting, the image prompt is too strong that it overpass the text prompt, but the non-Ella workflow result is realistic enough, they use the same ipadapter and the same hyperparameters, so I wonder if it's beacause Ella's output condition embedding does not matches with the origin SD's Unet as well as CLIPTextEncoder's output condition embeding. I wonder if there's simple trick to tackle this problem? free-training solution is better~ @JettHu

TencentQQGYLab / ComfyUI-ELLA

Worse image quality when combine with ipadapter #50