KaiChen1998 / GeoDiffusion

Official PyTorch implementation of GeoDiffusion in ICLR 2024 (https://arxiv.org/abs/2306.04607)
https://kaichen1998.github.io/projects/geodiffusion/
MIT License
64 stars 3 forks source link

the nuimage 256x256 recurrence #16

Closed Sirius-Li closed 4 months ago

Sirius-Li commented 4 months ago

The pretrained nuimage 256x256 model pretrained model generates 14772x5 = 15485 images for testing, and then calculates the fid with the val set 14772 images, while the fid is 19.48. Moreover, I retrain the nuimage 256x256 model on stable diffusion which reach 15.90, also not reaching the accuracy of paper(14.58). I resize the images to 256x256 and use fid_score of pytorch_fid package to calculate the fid. Is this correct ?

KaiChen1998 commented 4 months ago

As claimed in our README file, we recommend users follow the exact setting of the LAMA repo step by step for precise evaluation on the COCO-Stuff Layout-to-Image benchmark, since the FID calculation is extremely sensitive to the implementation details.

Currently, I notice the following differences: