AssemblyAI-Community / MinImagen

MinImagen: A minimal implementation of the Imagen text-to-image model
https://assemblyai-examples.github.io/MinImagen/
MIT License
294 stars 54 forks source link

Inference results of the overfitted model are noise #17

Open kirilllzaitsev opened 1 year ago

kirilllzaitsev commented 1 year ago

Having an overfitted 'base' part of the Imagen, I try to sample with it for 300 timesteps and get the following: image During training, it predicts the noise well, and MSE goes down steadily. Somewhat similar is the following issue https://github.com/lucidrains/imagen-pytorch/issues/337.

Could you give any advice on the direction to investigate the issue?

chiho-5 commented 1 year ago

For how many epochs did you train. I think I'm encountering the same problem

sadath-12 commented 1 year ago

same problem

swayampragnya-malla commented 11 months ago

same problem

kirilllzaitsev commented 11 months ago

same problem

Try increasing the number of epochs (for instance, 1500 epochs (1 epoch = 1 gradient step) for a single image) and tuning the learning rate (starting with small values <1e-4)