jianjieluo / SCD-Net

[CVPR23] A cascaded diffusion captioning model with a novel semantic-conditional diffusion process that upgrades conventional diffusion model with additional semantic prior.
https://arxiv.org/abs/2212.03099
Other
57 stars 5 forks source link

How is the training sentence pool obtained? #12

Open 1301358882 opened 1 month ago

1301358882 commented 1 month ago

Hello, I would like to ask the cross-modal model to retrieve semantically related sentences from the training sentence pool. How is the training sentence pool obtained? Thank you very much! 微信图片_20241004215626