JoePenna / Dreambooth-Stable-Diffusion

Implementation of Dreambooth (https://arxiv.org/abs/2208.12242) by way of Textual Inversion (https://arxiv.org/abs/2208.01618) for Stable Diffusion (https://arxiv.org/abs/2112.10752). Tweaks focused on training faces, objects, and styles.
MIT License
3.19k stars 558 forks source link

Establish a baseline with a sample set of training images #186

Closed toyssamurai closed 9 months ago

toyssamurai commented 1 year ago

It will be very helpful for beginners like me if your guide includes a sample set of traning images and a sample configuration. If we can use the same set of samples images and configuration to train, then we can verify our trained model to see if it can produce the same level of result as yours does.

yushan777 commented 1 year ago

Understood. However there is a privacy issue with using and sharing real people's faces - although Joe has allowed others to use his face for training in the past. Perhaps it can be done with an artstyle, or a face that's available in the public domain (but isn't a known celebrity). I'll try and add that when I have a bit of time. Even if you are able to reproduce what someone else can with their dataset, using your own will be a different experience. I've had to go back and curate my datasets many times over, training again and again until the model was able to reproduce the subject's likeness well and was flexible enough to be styled and used in different contexts.