Question about COCO zero-shot evaluation.

drboog / Shifted_Diffusion

Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)

Creative Commons Zero v1.0 Universal

159 stars 11 forks source link

Hi @drboog,

Contrasts on your paper's acceptance to CVPR, and thanks for sharing your wonderful work!

I have some questions about the COCO evaluation:

What dataset is used for evaluation on COCO? Is it COCO 2017 val set or COCO 2014 val set? Since COCO 2017 val set only has 5,000 images. But in your paper, you said that you randomly sampled 30,000 images. How do you randomly sample it?
Is the images sampled by randomly using differently designed prompts along with captions to make different combinations that lead to these 30K sampled images?
Could you please share your code for zero-shot COCO image generation? I find that COCO zero shot generation is a commonly used benchmark. However, no paper has released codes for this evaluation. But there are many details that are not clear. For example, some papers use COCO 2017 (e.g., StableDiffusion), while some papers use COCO 2014. Why is that?

Best, Runpei

drboog / Shifted_Diffusion