TheLastBen / fast-stable-diffusion

fast-stable-diffusion + DreamBooth
MIT License
7.51k stars 1.31k forks source link

Can't start training #2693

Open Barry-Bot1 opened 9 months ago

Barry-Bot1 commented 9 months ago

Training the UNet... WARNING[XFORMERS]: xFormers can't load C++/CUDA extensions. xFormers was built for: PyTorch 2.1.0+cu118 with CUDA 1106 (you have 2.1.0+cu121) Python 3.9.16 (you have 3.10.12) Please reinstall xformers (see https://github.com/facebookresearch/xformers#installing-xformers) Memory-efficient attention, SwiGLU, sparse and more won't be available. Set XFORMERS_MORE_DETAILS=1 for more details Traceback (most recent call last): File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 803, in main() File "/content/diffusers/examples/dreambooth/train_dreambooth.py", line 512, in main vae = AutoencoderKL.from_pretrained(args.pretrained_model_name_or_path, subfolder="vae") File "/usr/local/lib/python3.10/dist-packages/diffusers/models/modeling_utils.py", line 558, in from_pretrained raise ValueError( ValueError: Cannot load <class 'diffusers.models.autoencoder_kl.AutoencoderKL'> from /content/stable-diffusion-custom because the following keys are missing: decoder.mid_block.attentions.0.value.weight, encoder.mid_block.attentions.0.key.bias, decoder.mid_block.attentions.0.proj_attn.bias, encoder.mid_block.attentions.0.value.weight, decoder.mid_block.attentions.0.key.bias, encoder.mid_block.attentions.0.proj_attn.weight, encoder.mid_block.attentions.0.query.bias, decoder.mid_block.attentions.0.value.bias, decoder.mid_block.attentions.0.query.weight, encoder.mid_block.attentions.0.query.weight, decoder.mid_block.attentions.0.proj_attn.weight, decoder.mid_block.attentions.0.query.bias, encoder.mid_block.attentions.0.proj_attn.bias, decoder.mid_block.attentions.0.key.weight, encoder.mid_block.attentions.0.value.bias, encoder.mid_block.attentions.0.key.weight. Please make sure to pass low_cpu_mem_usage=False and device_map=None if you want to randomely initialize those weights or else make sure your checkpoint file is correct. Traceback (most recent call last): File "/usr/local/bin/accelerate", line 8, in sys.exit(main()) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/accelerate_cli.py", line 43, in main args.func(args) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/launch.py", line 837, in launch_command simple_launcher(args) File "/usr/local/lib/python3.10/dist-packages/accelerate/commands/launch.py", line 354, in simple_launcher raise subprocess.CalledProcessError(returncode=process.returncode, cmd=cmd) subprocess.CalledProcessError: Command '['/usr/bin/python3', '/content/diffusers/examples/dreambooth/train_dreambooth.py', '--image_captions_filename', '--train_only_unet', '--save_starting_step=500', '--save_n_steps=0', '--Session_dir=/content/gdrive/MyDrive/Fast-Dreambooth/Sessions/Alfie2', '--pretrained_model_name_or_path=/content/stable-diffusion-custom', '--instance_data_dir=/content/gdrive/MyDrive/Fast-Dreambooth/Sessions/Alfie2/instance_images', '--output_dir=/content/models/Alfie2', '--captions_dir=/content/gdrive/MyDrive/Fast-Dreambooth/Sessions/Alfie2/captions', '--instance_prompt=', '--seed=616357', '--resolution=512', '--mixed_precision=fp16', '--train_batch_size=1', '--gradient_accumulation_steps=1', '--use_8bit_adam', '--learning_rate=2e-06', '--lr_scheduler=linear', '--lr_warmup_steps=0', '--max_train_steps=1500']' returned non-zero exit status 1. Something went wrong

TheLastBen commented 9 months ago

fixed