Closed aurotripathy closed 1 year ago
Duplicate of https://github.com/huggingface/diffusers/issues/1062
@patil-suraj, @Revist:
Thanks for the guidance.
If I use just the prior-preservation loss script as-is, I'm able to reproduce the result (or something close for A photo of sks dog in a bucket
).
If I extend the fine-tuning to the text-encoder
, the input is the same as the output.
Is that expected?
I would love an answer here but I can close this and open a new one, if that's better.
Thank you again.
-Auro
I'm able to get decent results if I fine tune the text-encoder
, even using low quality images (all images look pretty similar).
I'm using 128 class images with 8 instance images, 800 steps. I had to try multiple times with different settings to get something like this:
a photo of sks man wearing an ironman suit
@entrpn, thank you.
Would you be willing to share your setting for fine-tuning the text encoder
(which ones)?
That would be helpful.
@aurotripathy as I said in the comment I linked, for dream booth we need to tune hyperparams to get the best results. Also, training text encoder is usually helpful for more complex concepts like faces. I don't think this is an issue with training script. I would suggest to try different settings and pick what works best for your use-case.
Thank you @patil-suraj Lots of good thoughts (for a newbie, me) Closing this.
Would you be willing to share your setting for fine-tuning the text encoder(which ones)? That would be helpful.
@aurotripathy maybe helps https://wandb.ai/psuraj/dreambooth/reports/Dreambooth-Training-Analysis--VmlldzoyNzk0NDc3
@aurotripathy take a look at this project. https://github.com/entrpn/serving-model-cards/tree/main/training-dreambooth. That's what I used for the results above.
Describe the bug
Disappointed that the results are not reproducible (others say the same as well). In my case, the output image is very similar to one of the input images.
My steps:
Get the seeding dog images as instructed
install
bitsandbytes
. Needs to point to where Cuda is installed.Run the finetune script
Run the inference script
Reproduction
No response
Logs
System Info
--