AUTOMATIC1111 / stable-diffusion-webui

Stable Diffusion web UI
GNU Affero General Public License v3.0
139.15k stars 26.41k forks source link

[Bug]: Weird class images generated from dreambooth #7752

Open AKRD44 opened 1 year ago

AKRD44 commented 1 year ago

Is there an existing issue for this?

What happened?

I wrote in "full body cat" under Concepts->Prompts->Class prompt and asked it to generate class images but they're coming out horribly. My model uses the checkpoint "v1-5-pruned-emaonly.ckpt".

What's weird is that when I just use text2img with the same checkpoint I get great results. But as soon as I have it create class images then I get these weird pixelated outputs. This didn't use to happen before. Not sure if it's something wrong with dreambooth or how I filled out the UI as it changed since I last used it. Thanks for the help!

screenshots of my configurations: concept_page settings_page

the monstrosities: full body cat-c527cf8b full body cat-b2e3e101 full body cat-b5c6cfb3 full body cat-b369e2aa

Steps to reproduce the problem

Go to dreambooth, Go to the concepts tab, Under prompts->class prompt write in "full body cat". Under image generation -> Class Images Per Instance Image write in 1. Under directories->Dataset Directory point to a folder with at least 1 picture

Then under the generate tap click generate class images and you should see the output

What should have happened?

A decent image of a cat should have been produced, just like it can with text2img

Commit where the problem happens

python: 3.10.0  •  torch: 1.13.1+cu117  •  xformers: 0.0.16rc425  •  gradio: 3.16.2  •  commit: 226d840e  •  checkpoint: e1441589a6

What platforms do you use to access the UI ?

Windows

What browsers do you use to access the UI ?

Google Chrome

Command Line Arguments

--xformers --medvram --opt-split-attention --autolaunch

List of extensions

depthmap2mask sd_dreambooth_extension stable-diffusion-webui-depthmap-script
stable-diffusion-webui-instruct-pix2pix stable-diffusion-webui-pixelization

Console logs

#######################################################################################################
Initializing Dreambooth
If submitting an issue on github, please provide the below text for debugging purposes:

Python revision: 3.10.0 (tags/v3.10.0:b494f59, Oct  4 2021, 19:00:18) [MSC v.1929 64 bit (AMD64)]
Dreambooth revision: b5b67040502bab07225cae02244fa9b77d94f160
SD-WebUI revision: 226d840e84c5f306350b0681945989b86760e616

Checking Dreambooth requirements...
[+] bitsandbytes version 0.35.0 installed.
[+] diffusers version 0.10.2 installed.
[+] transformers version 4.25.1 installed.
[+] xformers version 0.0.16rc425 installed.
[+] torch version 1.13.1+cu117 installed.
[+] torchvision version 0.14.1+cu117 installed.

#######################################################################################################

Installing imageio-ffmpeg requirement for depthmap script
Installing pyqt5 requirement for depthmap script

Launching Web UI with arguments: --xformers --medvram --opt-split-attention --autolaunch
SD-Webui API layer loaded
Loading weights [e1441589a6] from C:\Users\alexy\Desktop\ai_stuff\stable-diffusion-webui\models\Stable-diffusion\v1-5-pruned.ckpt
Creating model from config: C:\Users\alexy\Desktop\ai_stuff\stable-diffusion-webui\configs\v1-inference.yaml
LatentDiffusion: Running in eps-prediction mode
DiffusionWrapper has 859.52 M params.
Applying xformers cross attention optimization.
Textual inversion embeddings loaded(0):
Model loaded in 78.3s (load weights from disk: 68.4s, load config: 1.0s, create model: 2.9s, apply weights to model: 3.4s, apply half(): 1.9s, load VAE: 0.3s, load textual inversion embeddings: 0.3s).
Running on local URL:  http://127.0.0.1:7860

To create a public link, set `share=True` in `launch()`.
Generating class images...
Pre-processing fat_cats: 100%|████████████████████████████████████████████████████████| 35/35 [00:00<00:00, 699.98it/s]
Pre-processing classifiers_0: 100%|██████████████████████████████████████████████████████| 5/5 [00:00<00:00, 38.42it/s]
Concept requires 10 class images per instance image.
Sorting instance images: 100%|██████████████████████████████████████████████████████| 13/13 [00:00<00:00, 13053.85it/s]
We need a total of 345 class images.
Generating 345 class images for training...
Generating class images 0/345::   0%|                                                          | 0/345 [00:00<?, ?it/s]Loading model: v1-5-pruned-emaonly
Loading weights [cc6cb27103] from C:\Users\alexy\Desktop\ai_stuff\stable-diffusion-webui\models\Stable-diffusion\v1-5-pruned-emaonly.ckpt
Creating model from config: C:\Users\alexy\Desktop\ai_stuff\stable-diffusion-webui\configs\v1-inference.yaml
LatentDiffusion: Running in eps-prediction mode
DiffusionWrapper has 859.52 M params.
Applying xformers cross attention optimization.
Model loaded in 45.7s (load weights from disk: 41.6s, create model: 0.6s, apply weights to model: 0.9s, apply half(): 0.9s, load VAE: 1.5s).
100%|██████████████████████████████████████████████████████████████████████████████████| 20/20 [00:23<00:00,  1.16s/it]
100%|██████████████████████████████████████████████████████████████████████████████████| 20/20 [00:08<00:00,  2.40it/s]
Generating class images 2/345::   0%|▏                                                         | 1/345 [00:00<?, ?it/s]Generated 2 new class images.
Generating class images 2/345::   0%|▏                                                 | 1/345 [00:00<01:07,  5.13it/s]
Restored system models.

Additional information

No response

vladmandic commented 1 year ago

this should be reported in dreambooth repository (its a separate extension), not webui

78Alpha commented 1 year ago

https://github.com/d8ahazard/sd_dreambooth_extension/issues/883

Similar, comes from Xformers