[Issue]: Accelerate version 0.22.0 and above causes a bug in inpaint mode.

Issue Description

Inpaint mode fail when using Accelerate version 0.22.0 and above

21:45:15-408496 ERROR    Diffusers failed loading:                                                  
                         model=/stable-diffusion-data/models/stable-diffusion/i
                         npaintClothing_uberrealisticinpaint-inpainting.safetensors                 
                         pipeline=Autodetect/NoneType Trying to set a tensor of shape               
                         torch.Size([320, 9, 3, 3]) in "weight" (which has shape torch.Size([320, 4,
                         3, 3])), this look incorrect. 

21:55:10-778978 ERROR    Diffusers failed loading:                                                  
                         model=/stable-diffusion-data/models/stable-diffusion/a
                         ZovyaPhotoreal_v2InpaintVAE.safetensors pipeline=Autodetect/NoneType Trying
                         to set a tensor of shape torch.Size([320, 9, 3, 3]) in "weight" (which has 
                         shape torch.Size([320, 4, 3, 3])), this look incorrect.

Works good if rollback to: accelerate==0.21.0 Fail with: 0.22.0, 0.23.0, 0.24.0, 0.24.1

Version Platform Description

Linux 6.6.2-arch1-1 Intel Arc A380 AMD Ryzen 7 2700

Relevant log output

Setting OneAPI environment

:: initializing oneAPI environment ...
   webui.sh: BASH_VERSION = 5.2.21(1)-release
   args: Using "$@" for setvars.sh arguments: --data-dir /apps/sd.next --debug --use-ipex 
:: advisor -- latest
:: ccl -- latest
:: compiler -- latest
:: dal -- latest
:: debugger -- latest
:: dev-utilities -- latest
:: dnnl -- latest
:: dpcpp-ct -- latest
:: dpl -- latest
:: ipp -- latest
:: ippcp -- latest
:: ipp -- latest
:: mkl -- latest
:: mpi -- latest
:: tbb -- latest
:: vtune -- latest
:: oneAPI environment initialized ::

Launching ipexrun launch.py...
/apps/sd.next/venv/lib/python3.11/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: ''If you don't plan on using image functionality from `torchvision.io`, you can ignore this warning. Otherwise, there might be something wrong with your environment. Did you have `libjpeg` or `libpng` installed before building `torchvision` from source?
  warn(
2023-11-25 09:20:38.420725: I tensorflow/tsl/cuda/cudart_stub.cc:28] Could not find cuda drivers on your machine, GPU will not be used.
2023-11-25 09:20:38.590728: I tensorflow/tsl/cuda/cudart_stub.cc:28] Could not find cuda drivers on your machine, GPU will not be used.
2023-11-25 09:20:38.591196: I tensorflow/core/platform/cpu_feature_guard.cc:182] This TensorFlow binary is optimized to use available CPU instructions in performance-critical operations.
To enable the following instructions: AVX2 FMA, in other operations, rebuild TensorFlow with the appropriate compiler flags.
2023-11-25 09:20:39.829275: W tensorflow/compiler/tf2tensorrt/utils/py_utils.cc:38] TF-TRT Warning: Could not find TensorRT
2023-11-25 09:20:42.824176: I itex/core/wrapper/itex_gpu_wrapper.cc:35] Intel Extension for Tensorflow* GPU backend is loaded.
2023-11-25 09:20:42.903986: W itex/core/ops/op_init.cc:58] Op: _QuantizedMaxPool3D is already registered in Tensorflow
2023-11-25 09:20:42.923395: I itex/core/devices/gpu/itex_gpu_runtime.cc:129] Selected platform: Intel(R) Level-Zero
2023-11-25 09:20:42.923740: I itex/core/devices/gpu/itex_gpu_runtime.cc:154] number of sub-devices is zero, expose root device.
2023-11-25 09:20:43,326 - numexpr.utils - INFO - Note: NumExpr detected 16 cores but "NUMEXPR_MAX_THREADS" not set, so enforcing safe limit of 8.
2023-11-25 09:20:43,326 - numexpr.utils - INFO - NumExpr defaulting to 8 threads.
/apps/sd.next/venv/lib/python3.11/site-packages/intel_extension_for_pytorch/launcher.py:102: UserWarning: Backend is not specified, it will automatically default to cpu.
  warnings.warn(
2023-11-25 09:20:43,816 - intel_extension_for_pytorch.cpu.launch.launch - INFO - Use 'jemalloc' memory allocator.
2023-11-25 09:20:43,817 - intel_extension_for_pytorch.cpu.launch.launch - WARNING - 'intel' OpenMP runtime is not found in ['/apps/sd.next/venv/lib/', '/apps/.local/lib/', '/usr/local/lib/', '/usr/local/lib64/', '/usr/lib/', '/usr/lib64/', '/usr/lib/x86_64-linux-gnu/'].
2023-11-25 09:20:43,818 - intel_extension_for_pytorch.cpu.launch.launch - INFO - Use 'default' OpenMP runtime.
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - Use 'taskset' multi-task manager.
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: Untouched preset environment variables are not displayed.
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: LD_PRELOAD=/usr/lib/libjemalloc.so
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: MALLOC_CONF=oversize_threshold:1,background_thread:true,metadata_thp:auto
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: OMP_SCHEDULE=STATIC
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: OMP_PROC_BIND=CLOSE
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - env: OMP_NUM_THREADS=8
2023-11-25 09:20:43,823 - intel_extension_for_pytorch.cpu.launch.launch - INFO - cmd: taskset -c 0-7 /apps/sd.next/venv/bin/python -u launch.py --data-dir /apps/sd.next --debug --use-ipex 
09:20:44-320907 DEBUG    Logger: file=/apps/sd.next/sdnext.log       
                         level=10 size=0 mode=create                                                
09:20:44-324349 INFO     Starting SD.Next                                                           
09:20:44-325393 INFO     Python 3.11.6 on Linux                                                     
09:20:44-378649 INFO     Version: app=sd.next updated=2023-11-24 hash=a4a7a937                      
                         url=https://github.com/vladmandic/automatic/tree/master                    
09:20:45-268635 INFO     Platform: arch=x86_64 cpu= system=Linux release=6.6.2-arch1-1 python=3.11.6
09:20:45-271393 DEBUG    Setting environment tuning                                                 
09:20:45-273421 DEBUG    Cache folder: /apps/.cache/huggingface/hub  
09:20:45-275289 DEBUG    Torch overrides: cuda=False rocm=False ipex=True diml=False openvino=False 
09:20:45-277496 DEBUG    Torch allowed: cuda=False rocm=False ipex=True diml=False openvino=False   
09:20:45-280565 INFO     Intel OneAPI Toolkit detected                                              
09:20:45-283398 DEBUG    Package not found: onnxruntime-openvino                                    
09:20:45-285539 INFO     Installing package: onnxruntime-openvino                                   
09:20:45-287342 DEBUG    Running pip: install --upgrade onnxruntime-openvino                        
09:20:47-161053 DEBUG    Repository update time: Fri Nov 24 16:44:14 2023                           
09:20:47-163422 INFO     Startup: standard                                                          
09:20:47-164919 INFO     Verifying requirements                                                     
09:20:47-194984 WARNING  Package version mismatch: accelerate 0.21.0 required 0.24.1                
09:20:47-197707 INFO     Installing package: accelerate==0.24.1                                     
09:20:47-199518 DEBUG    Running pip: install --upgrade accelerate==0.24.1                          
09:20:50-683744 INFO     Verifying packages                                                         
09:20:50-686383 INFO     Verifying submodules                                                       
09:20:51-584283 DEBUG    Submodule: extensions-builtin/sd-extension-chainner / main                 
09:20:51-620198 DEBUG    Submodule: extensions-builtin/sd-extension-system-info / main              
09:20:51-659207 DEBUG    Submodule: extensions-builtin/sd-webui-agent-scheduler / main              
09:20:51-686062 DEBUG    Submodule: extensions-builtin/sd-webui-controlnet / main                   
09:20:51-738834 DEBUG    Submodule: extensions-builtin/stable-diffusion-webui-images-browser / main 
09:20:51-777749 DEBUG    Submodule: extensions-builtin/stable-diffusion-webui-rembg / master        
09:20:51-803998 DEBUG    Submodule: modules/k-diffusion / master                                    
09:20:51-833441 DEBUG    Submodule: modules/lora / main                                             
09:20:51-874046 DEBUG    Submodule: wiki / master                                                   
09:20:51-897701 DEBUG    Register paths                                                             
09:20:52-004481 DEBUG    Installed packages: 239                                                    
09:20:52-005438 DEBUG    Extensions all: ['Lora', 'sd-extension-chainner',                          
                         'sd-extension-system-info', 'sd-webui-agent-scheduler',                    
                         'stable-diffusion-webui-images-browser', 'stable-diffusion-webui-rembg']   
09:20:52-111643 DEBUG    Running extension installer:                                               
                         /apps/sd.next/extensions-builtin/sd-extensio
                         n-system-info/install.py                                                   
09:20:52-382004 DEBUG    Running extension installer:                                               
                         /apps/sd.next/extensions-builtin/sd-webui-ag
                         ent-scheduler/install.py                                                   
09:20:52-606912 DEBUG    Running extension installer:                                               
                         /apps/sd.next/extensions-builtin/stable-diff
                         usion-webui-images-browser/install.py                                      
09:20:52-853594 DEBUG    Running extension installer:                                               
                         /apps/sd.next/extensions-builtin/stable-diff
                         usion-webui-rembg/install.py                                               
09:20:53-151891 DEBUG    Extensions all: ['sd-webui-aspect-ratio-helper']                           
09:20:53-213062 INFO     Extensions enabled: ['Lora', 'sd-extension-chainner',                      
                         'sd-extension-system-info', 'sd-webui-agent-scheduler',                    
                         'stable-diffusion-webui-images-browser', 'stable-diffusion-webui-rembg',   
                         'sd-webui-aspect-ratio-helper']                                            
09:20:53-214346 INFO     Verifying requirements                                                     
09:20:53-247842 DEBUG    Setup complete without errors: 1700914853                                  
09:20:53-252480 INFO     Extension preload: {'extensions-builtin': 0.0,                             
                         '/apps/sd.next/extensions': 0.0}            
09:20:53-253639 DEBUG    Starting module: <module 'webui' from                                      
                         '/apps/sd.next/webui.py'>                   
09:20:53-254663 INFO     Command line args: ['--data-dir',                                          
                         '/apps/sd.next', '--debug', '--use-ipex']   
                         data_dir=/apps/sd.next debug=True           
                         use_ipex=True                                                              
/apps/sd.next/venv/lib/python3.11/site-packages/torchvision/io/image.py:13: UserWarning: Failed to load image Python extension: ''If you don't plan on using image functionality from `torchvision.io`, you can ignore this warning. Otherwise, there might be something wrong with your environment. Did you have `libjpeg` or `libpng` installed before building `torchvision` from source?
  warn(
09:20:56-245312 DEBUG    Load IPEX==2.0.110+xpu                                                     
09:20:58-104814 INFO     Load packages: torch=2.0.1a0+cxx11.abi diffusers=0.23.1 gradio=3.43.2      
09:20:58-804397 DEBUG    Read: file="/apps/sd.next/config.json"      
                         json=38 bytes=2101                                                         
09:20:58-806602 INFO     Engine: backend=Backend.DIFFUSERS compute=ipex mode=no_grad device=xpu     
                         cross-optimization="Scaled-Dot-Product"                                    
09:20:58-808376 INFO     Device: device=Intel(R) Arc(TM) A380 Graphics n=1 ipex=2.0.110+xpu         
2023-11-25 09:21:01.712141: I itex/core/wrapper/itex_gpu_wrapper.cc:35] Intel Extension for Tensorflow* GPU backend is loaded.
2023-11-25 09:21:01.754231: W itex/core/ops/op_init.cc:58] Op: _QuantizedMaxPool3D is already registered in Tensorflow
2023-11-25 09:21:01.771236: I itex/core/devices/gpu/itex_gpu_runtime.cc:129] Selected platform: Intel(R) Level-Zero
2023-11-25 09:21:01.771487: I itex/core/devices/gpu/itex_gpu_runtime.cc:154] number of sub-devices is zero, expose root device.
09:21:02-475383 DEBUG    Entering start sequence                                                    
09:21:02-476439 INFO     Using data path: /apps/sd.next              
09:21:02-486159 DEBUG    Initializing                                                               
09:21:02-489285 INFO     Available VAEs:                                                            
                         path="/apps/.local/share/stable-diffusion-data/models/vae" items=0      
09:21:02-490385 INFO     Disabling uncompatible extensions: backend=Backend.DIFFUSERS               
                         ['multidiffusion-upscaler-for-automatic1111', 'a1111-sd-webui-lycoris']    
09:21:02-492451 DEBUG    Scanning diffusers cache:                                                  
                         /apps/.local/share/stable-diffusion-data/models/Diffusers               
                         /apps/.local/share/stable-diffusion-data/models/Diffusers items=1       
                         time=0.00                                                                  
09:21:02-494898 DEBUG    Read: file="/apps/sd.next/cache.json" json=2
                         bytes=49127                                                                
09:21:02-503657 DEBUG    Read: file="/apps/sd.next/metadata.json"    
                         json=252 bytes=870327                                                      
09:21:02-519797 INFO     Available models:                                                          
                         path="/apps/.local/share/stable-diffusion-data/models/stable-diffusion" 
                         items=44 time=0.03                                                         
09:21:02-785759 DEBUG    Load extensions                                                            
09:21:04-439179 INFO     Extension:                                                                 
                         script='extensions-builtin/sd-webui-agent-scheduler/scripts/task_scheduler.
                         py' Using sqlite file:                                                     
                         extensions-builtin/sd-webui-agent-scheduler/task_scheduler.sqlite3         
09:21:05-355978 INFO     Extensions time: 2.57 { sd-webui-aspect-ratio-helper=0.52 Lora=0.30        
                         sd-extension-chainner=0.08 sd-webui-agent-scheduler=0.70                   
                         stable-diffusion-webui-images-browser=0.12                                 
                         stable-diffusion-webui-rembg=0.80 }                                        
09:21:05-503967 DEBUG    Read: file="html/upscalers.json" json=4 bytes=2640                         
09:21:05-506209 DEBUG    Read: file="extensions-builtin/sd-extension-chainner/models.json" json=24  
                         bytes=2693                                                                 
09:21:05-507464 DEBUG    chaiNNer models:                                                           
                         path="/apps/.local/share/stable-diffusion-data/models/chaiNNer"         
                         defined=24 discovered=0 downloaded=0                                       
09:21:05-509850 DEBUG    Load upscalers: total=52 downloaded=1 user=0 ['None', 'Lanczos', 'Nearest',
                         'ChaiNNer', 'ESRGAN', 'LDSR', 'RealESRGAN', 'SCUNet', 'SD', 'SwinIR']      
09:21:05-518718 DEBUG    Load styles:                                                               
                         folder="/apps/sd.next/models/styles"        
                         items=288                                                                  
09:21:05-523022 DEBUG    Creating UI                                                                
09:21:06-129870 INFO     Load UI theme: name="gradio/base" style=Light base=base.css                
09:21:06-131268 WARNING  Using Gradio default theme which is not optimized for SD.Next              
09:21:06-196067 DEBUG    Read: file="html/reference.json" json=13 bytes=7896                        
09:21:06-399195 DEBUG    Extra networks: page='model' items=57 subfolders=3 tab=txt2img             
                         folders=['/apps/.local/share/stable-diffusion-data/models/stable-diffusi
                         on', '/apps/.local/share/stable-diffusion-data/models/Diffusers',       
                         'models/Reference',                                                        
                         '/apps/.local/share/stable-diffusion-data/models/Stable-diffusion']     
                         list=0.02 desc=0.01 info=0.03                                              
09:21:06-408660 DEBUG    Extra networks: page='style' items=288 subfolders=2 tab=txt2img            
                         folders=['/apps/sd.next/models/styles',     
                         'html'] list=0.01 desc=0.00 info=0.00                                      
09:21:06-412067 DEBUG    Extra networks: page='embedding' items=35 subfolders=1 tab=txt2img         
                         folders=['/apps/.local/share/stable-diffusion-data/models/embeddings']  
                         list=0.03 desc=0.01 info=0.13                                              
09:21:06-414648 DEBUG    Extra networks: page='hypernetwork' items=0 subfolders=1 tab=txt2img       
                         folders=['/apps/.local/share/stable-diffusion-data/models/hypernetworks'
                         ] list=0.00 desc=0.00 info=0.00                                            
09:21:06-417072 DEBUG    Extra networks: page='vae' items=0 subfolders=1 tab=txt2img                
                         folders=['/apps/.local/share/stable-diffusion-data/models/vae']         
                         list=0.00 desc=0.00 info=0.00                                              
09:21:06-425798 DEBUG    Extra networks: page='lora' items=209 subfolders=1 tab=txt2img             
                         folders=['/apps/.local/share/stable-diffusion-data/models/lora',        
                         '/apps/.local/share/stable-diffusion-data/models/LyCORIS'] list=0.16    
                         desc=0.06 info=0.83                                                        
09:21:06-625330 DEBUG    Read: file="/apps/sd.next/ui-config.json"   
                         json=0 bytes=2                                                             
09:21:06-809139 DEBUG    Themes: builtin=6 default=5 external=55                                    
09:21:07-400902 DEBUG    Script: 0.52 ui_tabs                                                       
                         /apps/sd.next/extensions-builtin/stable-diff
                         usion-webui-images-browser/scripts/image_browser.py                        
09:21:07-554275 DEBUG    Extension list: processed=10 installed=10 enabled=7 disabled=3 visible=10  
                         hidden=0                                                                   
09:21:07-931734 INFO     Local URL: http://127.0.0.1:7860/                                          
09:21:07-932688 DEBUG    Gradio functions: registered=1611                                          
09:21:07-933380 INFO     Initializing middleware                                                    
09:21:07-950538 DEBUG    Creating API                                                               
09:21:08-046642 DEBUG    SD-System-Info: benchmark data loaded:                                     
                         /apps/sd.next/extensions-builtin/sd-extensio
                         n-system-info/scripts/benchmark-data-local.json                            
09:21:08-376418 INFO     [AgentScheduler] Task queue is empty                                       
09:21:08-378829 INFO     [AgentScheduler] Registering APIs                                          
09:21:08-416376 DEBUG    Script: 0.32 app_started                                                   
                         /apps/sd.next/extensions-builtin/sd-webui-ag
                         ent-scheduler/scripts/task_scheduler.py                                    
09:21:08-528451 DEBUG    Scripts setup: ['X/Y/Z Grid:0.006']                                        
09:21:08-529444 DEBUG    Model metadata:                                                            
                         file="/apps/sd.next/metadata.json" no       
                         changes                                                                    
09:21:08-530231 DEBUG    Model auto load disabled                                                   
09:21:08-537183 DEBUG    Save: file="/apps/sd.next/config.json"      
                         json=38 bytes=2101                                                         
09:21:08-538114 INFO     Startup time: 15.27 { torch=3.98 gradio=0.84 libraries=4.37 extensions=2.57
                         face-restore=0.26 upscalers=0.15 ui-extra-networks=0.91 ui-img2img=0.07    
                         ui-settings=0.25 ui-extensions=0.71 ui-defaults=0.06 launch=0.32 api=0.09  
                         app-started=0.49 }                                                         
09:21:16-470029 INFO     MOTD: N/A                                                                  
09:21:19-191346 DEBUG    Themes: builtin=6 default=5 external=55                                    
09:21:19-417052 INFO     Browser session: user=None client=127.0.0.1 agent=Mozilla/5.0 (X11; Linux  
                         x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/118.0.0.0            
                         Safari/537.36                                                              
09:21:26-514781 INFO     Select: model="inpaintClothing_uberrealisticinpaint-inpainting             
                         [53beed7e09]"                                                              
09:21:26-516778 DEBUG    Load model weights: existing=False                                         
                         target=/apps/.local/share/stable-diffusion-data/models/stable-diffusion/
                         inpaintClothing_uberrealisticinpaint-inpainting.safetensors info=None      
Loading model: /apps/.local/share/stable-diffusion-data/models/stable-diffusion/inpaintClothing… 
09:21:30-754365 DEBUG    Desired Torch parameters: dtype=BF16 no-half=False no-half-vae=False       
                         upscast=False                                                              
09:21:30-755952 INFO     Setting Torch parameters: device=xpu dtype=torch.bfloat16                  
                         vae=torch.bfloat16 unet=torch.bfloat16 context=no_grad fp16=False bf16=True
09:21:30-757086 INFO     Autodetect: model="Stable Diffusion" class=StableDiffusionPipeline         
                         file="/apps/.local/share/stable-diffusion-data/models/stable-diffusion/i
                         npaintClothing_uberrealisticinpaint-inpainting.safetensors" size=4068MB    
09:21:30-989089 ERROR    Diffusers failed loading:                                                  
                         model=/apps/.local/share/stable-diffusion-data/models/stable-diffusion/i
                         npaintClothing_uberrealisticinpaint-inpainting.safetensors                 
                         pipeline=Autodetect/NoneType Trying to set a tensor of shape               
                         torch.Size([320, 9, 3, 3]) in "weight" (which has shape torch.Size([320, 4,
                         3, 3])), this look incorrect.

Backend

Diffusers

Branch

Master

Model

SD 1.5

Acknowledgements

[X] I have read the above and searched for existing issues
[X] I confirm that this is classified correctly and its not an extension issue

vladmandic / automatic

[Issue]: Accelerate version 0.22.0 and above causes a bug in inpaint mode. #2559