Open Brembles opened 1 year ago
Thanks for your interest! I believe somewhere around 500-1000 epochs should do the job. If you are using the source code from our demo (diffusers version), then I would suggest 200 epochs. If you are interested in our demo, it is linked on our project website.
Thank you very much for your guidance! When I was training using your model, I only used Van Gogh as the prompt, and also set words as the prompt. The learning rate was set to 1e-5 and I only trained the xattn. I trained for 200 epochs at a time, and once for 500 epochs, but the results were not as good as the checkpoint you provided. How did you do it?
We trained it exactly the way you described. When you say "results are not as good" - how do you mean? Could you share some results?
What I mean is that my checkpoint doesn't completely remove the concept of "Van Gogh" under certain prompts. Could you share your training parameters with me?
Aah, so the ESD model is not loaded properly! clearly they look very close to each other. We used the default parameters from the training script we provided. I would look at the file name and it is being loaded properly. In the evaluation script before you do .load()
please check if the file it is trying to load exists. by doing os.path.exists(model_path)
.
The checkpoint I published is the one I trained. Your checkpoint performs very well, so I think my model loading is correct. I don't understand where I went wrong with my training parameters. Currently, I have set train-epoch=500 and lr=1×10^-5. Could you tell me your training parameters?
As I mentioned, we use the default parameters in the training script - LR 1e-5 and 1000 epochs
Thank you for your guidance, I will try again!
I retrained the model but the results are still not good. Maybe you could give me your email address, so I can contact you through email.
Your work is fantastic. If I want to train a model to specifically remove a concept similar to Van Gogh, how many epochs would be required ?