lucidrains / DALLE2-pytorch

Implementation of DALL-E 2, OpenAI's updated text-to-image synthesis neural network, in Pytorch
MIT License
11.17k stars 1.09k forks source link

prior train #236

Open yangzhip opened 2 years ago

yangzhip commented 2 years ago

RuntimeError: Sizes of tensors must match except in dimension 1. Expected size 768 but got size 512 for tensor number 3 in the list.

my clip feature is 512 how can i train this prior

lucidrains commented 2 years ago

@yangzhip you need to set image_embed_dim = 512 when instantiating DiffusionPrior

yangzhip commented 2 years ago

@yangzhip you need to set image_embed_dim = 512 when instantiating DiffusionPrior

Yes, the dimension I have changed in the config file is 512, but there is still an error. The character of the text in the code after the clip is 768. How to change it here? Thank you!

lucidrains commented 2 years ago

@yangzhip could you share your script?