FID scores not the same as reported in the paper (using the published E2H model weights)

Hello @alexzhou907 !

I used the official model weights posted here on the E2H dataset. I followed the code and sampled for 40 steps, but when I evaluate the FID and IS metrics, the results are vastly different compared to the numbers reported on the paper.

With 40 steps (so 119 NFEs), the FID is about 12 and IS is about 4.1. In the paper, the numbers reported are 1.83 FID and 3.73 IS.

Is the published weights in the paper the correct one? Could you please check this out or help me out with this issue?

Best regards

alexzhou907 / DDBM

FID scores not the same as reported in the paper (using the published E2H model weights) #12