Jack000 / glid-3-xl-stable

stable diffusion training
MIT License
290 stars 36 forks source link

Does this work with CLIP guidance for stable diffusion? #8

Open eware-godaddy opened 1 year ago

eware-godaddy commented 1 year ago

Wondering if this code base means we can use CLIP guidance for generation instead of the classifier free guidance in the regular model?

Jack000 commented 1 year ago

kind of, you can backprop the gradients through the vae but it uses a lot of vram and doesn't work that well in my experience. Ideally there should be a latent CLIP trained on the LDM embeddings instead of pixels.