tobran / GALIP

[CVPR2023] A faster, smaller, and better text-to-image model for large-scale training
MIT License
225 stars 25 forks source link

reproduce issue #9

Closed taki0112 closed 1 year ago

taki0112 commented 1 year ago

Hello. I trained your code on the coco dataset, but it doesn't seem to reproduce correctly. Should I set mixed_precision to True? The default value is False, but I'm wondering if it needs to be changed.

The following image is the resulting image obtained after training. It is significantly different from the coco image displayed on the main GitHub page.

image

tobran commented 1 year ago

Hi, the default value of mixed_precision is True in our code. You can print the args to check it before training. The current training results do not seem to be trained enough, what is the FID of the current result? The FID of a sufficiently trained GALIP could be around 5.5. If the current FID is relatively high, you can continue training for some time.