CompVis / latent-diffusion

High-Resolution Image Synthesis with Latent Diffusion Models
MIT License
11.58k stars 1.51k forks source link

Evaluating first stage autoencoders #58

Open leod opened 2 years ago

leod commented 2 years ago

First of all, thank you so much for making this high-quality repository as well as pretrained models publicly available! This is highly useful for exploring your research.

I am currently training first stage autoencoders on a custom dataset (SoundCloud images) and am struggling with evaluating these models (other than with the loss values logged in TensorBoard). I plan to compare the performance of initializing the autoencoder weights randomly vs. fine-tuning one of your pretrained autoencoders.

I would prefer to calculate rFID, PSNR, and PSIM the same way as you did for your results table. Could you please provide a hint as to how you evaluate your autoencoders? Is there some other repository or toolkit that you rely on?

wtliao commented 1 year ago

@leod Hi, I want to do the same evaluation. Have you figured it out how to do that? Thanks!

Eurus-Holmes commented 7 months ago

@wtliao @leod @rromb FYI, I added some evaluation scripts for first stage autoencoders https://github.com/CompVis/latent-diffusion/pull/353/files

wtliao commented 7 months ago

@wtliao @leod @rromb FYI, I added some evaluation scripts for first stage autoencoders https://github.com/CompVis/latent-diffusion/pull/353/files

It is really appreciate! I will try and discuss with you further!