drboog / Shifted_Diffusion

Code for Shifted Diffusion for Text-to-image Generation (CVPR 2023)
Creative Commons Zero v1.0 Universal
159 stars 11 forks source link

Question about COCO zero-shot evaluation. #7

Closed RunpeiDong closed 1 year ago

RunpeiDong commented 1 year ago

Hi @drboog,

Contrasts on your paper's acceptance to CVPR, and thanks for sharing your wonderful work!

I have some questions about the COCO evaluation:

Best, Runpei

drboog commented 1 year ago
  1. Similar to other works, the zero-shot result is evaluated on COCO 2014 split;
  2. Captions of validation set will be randomly sampled, as the inputs for the model. No additional human designed template/text/prompt is used during evaluation;
  3. I don't know why Stable Diffusion reports results on COCO 2017, but the LDM paper (which SD is based on) evaluates results on COCO 2014. And I think almost all published paper report zero-shot evaluation on 2014 split. COCO 2017 images are more often used in downstream evaluation (after fine-tuning, thus not zero-shot), along with Local Narrative text;