Closed StarCycle closed 3 months ago
Thanks for your great suggestion. We evaluated the reconstruction performance of SD1.4 VAE on ImageNet and the rFID is 0.74. As you say, since they adopt much more training data, it's hard to make a fair comparison with them.
Hi,
Do you compare your ckpt with the vae/vqvae here?
If the direct comparison is not reasonable because their ckpts are trained with more data, did they have a version that was only trained with imagenet data but with the same config? Or can you evaluate their ckpt on imagenet even if these are trained on more data...I just want to know which ckpt is most suitable for my current application.
Also tencent released their Open-MAGVIT2
Best, StarCycle