Closed ckczzj closed 6 months ago
@ckczzj could you try it again by re-downloading the model weights from HF and with the latest, original code? And also load model state dict with var.load_state_dict(var_ckpt, strict=True).
Thanks for replying. Today I re-clone the repo and re-download the ckpt (and load model with strict=True
) to calculate the FID for d16 model, and the result is still 18.25.
@ckczzj could you save the images into 1000 folders according to their labels (from 0 to 999)? and then visualize the label 980 (it'll have 50 images) to have a quick check.
I have solved the problem. Thanks for your replying.
Great works and thanks for publishing the code!
I encountered some problems when calculating the FID.
Originally I use my own FID calculation code to calculate the FID between the images from ImageNet validation dataset and their VQVAE autoencoding reconstruction, and the result is 0.92, which is make sense. However, when I use the same code to calculate the FID between the images from ImageNet validation dataset and your d16 conditional generated images, the result is 19.13.
I also use your method to calculate the FID: we use the following code to create a npz file and run
python evaluator.py VIRTUAL_imagenet256_labeled.npz tmp.npz
. The result is 18.25. Do your have any idea where I make mistakes.