Closed SophieZhou closed 6 days ago
And by the way, how to generate the text messages or text annotations for each image?
And by the way, how to generate the text messages or text annotations for each image?
In your paper, I have not noticed this information. How to describe the images, using "cars, color, street" all these information?
And by the way, how to generate the text messages or text annotations for each image? I see, using images to generate text information by CLIP. Is it right.
The work is very nice and very sensible. But in the paper, you did not tell, which VAE model did you use. Does VAE effect the results? Is any VAE model all right?