Lora example does not seem to train anyting

kopyl commented 10 months ago

Describe the bug

I ran into this bug: https://github.com/huggingface/diffusers/issues/5897 so I used this for the training: https://github.com/huggingface/diffusers/blob/1477865e4838d887bb93750dc325e10f1e6ae534/examples/text_to_image/train_text_to_image_lora.py

Reproduction

Install:

!git clone https://github.com/huggingface/diffusers
%cd diffusers
!pip install .
%cd examples/text_to_image
!pip install -r requirements.txt

Train:

!accelerate launch --mixed_precision="fp16" train_text_to_image_lora.py \
  --pretrained_model_name_or_path=lambdalabs/miniSD-diffusers \
  --dataset_name=kopyl/logo-dataset-40 \
  --resolution=256 \
  --train_batch_size=1024 \
  --num_train_epochs=100 --checkpointing_steps=500 \
  --learning_rate=1e-04 --lr_scheduler="constant" --lr_warmup_steps=0 \
  --seed=42 \
  --output_dir="/workspace/lora-logo-test-2" \
  --noise_offset=0.05 \
  --cache_dir="/workspace/dataset-cache" \
  --validation_prompt="a black minimalistic abstract logo on white background" \
  --report_to="wandb"

Logs

No response

System Info

RTX 4090, Linux

Who can help?

No response

maliozer commented 10 months ago

try with more basic setup with removing --mixed_precision and use tensorboard rather than wandb

sayakpaul commented 9 months ago

Can you try this again? We recently performed some upgrades to the scripts.

kopyl commented 9 months ago

Can you try this again? We recently performed some upgrades to the scripts.

can not do that right now, will do later, thanks.

What did you change exactly in the script?

kopyl commented 9 months ago

try with more basic setup with removing --mixed_precision and use tensorboard rather than wandb

indeed removing --mixed_precision helped. But now i got twice as slow training :(

Is there any ways to still use --mixed_precision as shown in the official Diffusers guide?

Here is the error i'm getting: https://pastebin.com/1H1dMF5q

@maliozer

sayakpaul commented 9 months ago

Removing --mixed_precision shouldn't have to be necessary.

See Colab: https://colab.research.google.com/gist/sayakpaul/ce4eb75b1e5751f4d389a086ca2880a2/scratchpad.ipynb

github-actions[bot] commented 8 months ago

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

maliozer commented 7 months ago

I have to awake this issue again, I have tried the lora example with and without lora weights with same seed (42) with no change on generated images. It seems this image comes from only pretrained stable diffusion itself.

0_nolora_blue_pokemon.png

1_with_lora_blue_pokemon.png same image as above.

I shared a colab: https://colab.research.google.com/drive/1l_gDFGgRF5PDad3lDLHoF3SrEFqJ4jDq?usp=sharing @sayakpaul

sayakpaul commented 7 months ago

I don't have access to "/content/drive/MyDrive/research/temp/lora/pokemon/checkpoint-100" in your notebook.

maliozer commented 7 months ago

pipe.unet.load_attn_procs(f"checkpoint_path", adapter_name="pokemon")
pipe.unet.set_adapters(["pokemon"], weights=[1.0])
pipe.unet.fuse_lora()

has no effect but

pipe.unload_lora_weights()
pipe.load_lora_weights("checkpoint_path")

works properly. I can move with this method but not sure for further issues related to set_adapters, fuse_lora . for your info.

sayakpaul commented 7 months ago

pipe.unet.load_attn_procs(f"checkpoint_path", adapter_name="pokemon")

Instead of this, use load_lora_weights() on the pipe object?

maliozer commented 7 months ago

how to fuse multiple adapters without set_adapters ?

sayakpaul commented 7 months ago

Why would you not want to use that?

maliozer commented 7 months ago

Ok I got the process now :smile:

huggingface / diffusers