Closed TousakaNagio closed 1 year ago
Hi, in the case of diffusers based code, we keep all embeddings trainable but manually set the gradient of all other embeddings except \<new1> token (denoted as V* in the paper) to 0 here. Thanks!
I got it. Thank you for your respond!
Another question. Would you please indicate the code which corresponding to here in train.py/custom_modules.py/model.py? Thanks!
Thank you for your detailed explanation. It has been very helpful for my research.
In the paper "Multi-Concept Customization of Text-to-Image Diffusion," it is stated that only the target token "new1" or V* was tuned. However, upon reviewing the code in diffuser_training.py, it appears that the entire embedding was optimized. Can you clarify if this is accurate? Thank you for your response.