Open lcxsnow opened 1 year ago
In the paper, it is mentioned that the text_encoder will be fixed during the image synthesis training. However, in the provided code, the text_encoder is frozen, meaning it won’t be trained or optimized. Did I misunderstand something?
You can set it as trainable in the code. We did ablation studies both for freezing and training the text_encoder.
What do you mean by the text_encoder is frozen ? I thought you have to pre-train the DAMSM model, which place in the code said that ? requires_grad = False means to freeze the model. Yes, pre-train the DAMSM model, and freeze it when you are training GAN.
In the paper, it is mentioned that the text_encoder will be fixed during the image synthesis training. However, in the provided code, the text_encoder is frozen, meaning it won’t be trained or optimized. Did I misunderstand something?
You can set it as trainable in the code. We did ablation studies both for freezing and training the text_encoder.
Another question, I thought FID is more important than IS, because IS can't see the intra. Why do you pay more attention to IS in the paper.
In the paper, it is mentioned that the text_encoder will be fixed during the image synthesis training. However, in the provided code, the text_encoder is frozen, meaning it won’t be trained or optimized. Did I misunderstand something?