Open GreenTeaBD opened 1 year ago
Same thing with Ubuntu 22.10 (not WSL) Cuda 11.6 python=3.10 pytorch1.12.1 torchvision 0.13.1 torchaudio 0.12.1 cudatoolkit 11.6, triton, xformers
Running with;
export MODEL_NAME="CompVis/stable-diffusion-v1-4" export INSTANCE_DIR="training" export CLASS_DIR="classes" export OUTPUT_DIR="model"
accelerate launch train_dreambooth.py \ --pretrained_model_name_or_path=$MODEL_NAME \ --instance_data_dir=$INSTANCE_DIR \ --class_data_dir=$CLASS_DIR \ --output_dir=$OUTPUT_DIR \ --with_prior_preservation --prior_loss_weight=1.0 \ --instance_prompt="skscody" \ --class_prompt="a photo of person" \ --seed=1337 \ --resolution=512 \ --train_batch_size=1 \ --gradient_accumulation_steps=1 --gradient_checkpointing \ --learning_rate=5e-6 \ --lr_scheduler="constant" \ --lr_warmup_steps=0 \ --num_class_images=200 \ --sample_batch_size=1 \ --max_train_steps=1000 \ --mixed_precision=fp16
Edit: And in the colab, so I am either insane/breaking something that should be obvious or somethings broken
https://github.com/ShivamShrirao/diffusers/issues/159#issuecomment-1344621003
Check your images are being uploaded. I think they have to be size: 512x512
Describe the bug
Ubuntu WSL and Windows Fails to train in the same way in both, fails at Caching latents
Reproduction
No response
Logs
System Info
diffusers
version: 0.11.1