justinpinkney / stable-diffusion

MIT License
1.46k stars 269 forks source link

text input for Image Mixer #66

Open betterze opened 1 year ago

betterze commented 1 year ago

Dear Justin,

Thank you for your work in stable diffusion; it benefits me a lot.

Could you elaborate on how you train the 'Image Mixer' model? Why can it accept text input if the model is fine-tuned with only a CLIP image encoder? Or is the model fine-tuned with both the CLIP image and text encoder?

I appreciate your help.

Best wishes,

Zongze

MichaelFan01 commented 1 year ago

+1