CompVis / stable-diffusion

A latent text-to-image diffusion model
https://ommer-lab.com/research/latent-diffusion-models/
Other
67.44k stars 10.07k forks source link

Can i use diffusion model for Image Captioning ? #184

Open MohamedAliRashad opened 2 years ago

MohamedAliRashad commented 2 years ago

I want to run the models forward and backward ... Entering text and generating images and entering images and getting a text describing them. is this possible?

yifeisu commented 1 year ago

Hi, I have seen some related work, and I have posted it below https://arxiv.org/abs/2211.11694 https://arxiv.org/abs/2210.04559