Empty Condition for training unmodal image generation

ziqihuangg / Collaborative-Diffusion

[CVPR 2023] Collaborative Diffusion

https://ziqihuangg.github.io/projects/collaborative-diffusion.html

Other

405 stars 31 forks source link

Empty Condition for training unmodal image generation #26

Closed Open-Debin closed 11 months ago

Open-Debin commented 1 year ago

I am very interested in your work, thank you for sharing the codes of Collaborative Diffusion. I have two questions, what is the empty condition when you train text/mask to image generation? If I understand correctly, for text-to-image generation, the empty condition is [""], as your code shows "uc = model.get_learned_conditioning(n_samples * [""])". However, it's the empty condition for training mask to image generation. Is it a zero mask? Thank you very much for your attention.

ziqihuangg commented 11 months ago

“if scale != 1.0:” is by default never true, so the “uc = model.get_learned_conditioning(n_samples * [""])” is never implemented. If you want to use classifier-free guidance, perhaps you can try all-zero tensor as the mask.