Open Exuan148 opened 5 months ago
Hi, in the part of training free image generation pipeline, you inject features of several reference images into the self-attention, I would like to ask that where is the image features from? Are they from VAE encoder? Thanks!
Hi, in the part of training free image generation pipeline, you inject features of several reference images into the self-attention, I would like to ask that where is the image features from? Are they from VAE encoder? Thanks!