[Issue]: Current Diffusers fromFile doesn't seem to support LatentConsistencyModelPipeline

midcoastal commented 9 months ago

Issue Description

I noticed that one of my LCM models was not loading, so I went in and adjusted the matching criteria for LCM to properly detect the model name, and when I loaded it, I got the following error:

11:00:58-386201 ERROR    Diffusers failed loading: model=/[...]/modelname_lcm.safetensors pipeline=Autodetect/NoneType Unhandled pipeline class:        
                         LatentConsistencyModelPipeline

Unsure what I did wrong, I see others here who look like they have gotten LCM to work correctly, but this seems like a bug? This is on current dev and a launch with --reinstall.

Version Platform Description

No response

Relevant log output

Starting SD.Next                                                                                                                                                                                            
Logger: file="/[...]/sdnext.log" level=DEBUG size=64 mode=create                                                                                                     
Python 3.10.12 on Linux                                                                                                                                                                                     
Version: app=sd.next updated=2023-11-25 hash=f4f3a082 url=https://github.com/vladmandic/automatic.git/tree/dev                                                                                          
Latest published version: 8136723f420006ddf8aa02c52f4ab7e98495557b 2023-11-27T08:19:15Z                                                                                                                     
Platform: arch=x86_64 cpu=x86_64 system=Linux release=6.3.13-060313-generic python=3.10.12                                                                                                                  
Setting environment tuning                                                                                                                                                                                  
Cache folder: /[...]/.cache/huggingface/hub                                                                                                                                                        
Torch overrides: cuda=False rocm=False ipex=False diml=False openvino=False                                                                                                                                 
Torch allowed: cuda=True rocm=True ipex=True diml=True openvino=True                                                                                                                                        
Installing package: onnxruntime                                                                                                                                                                             
Running pip: install --upgrade onnxruntime                                                                                                                                                                  
nVidia CUDA toolkit detected: nvidia-smi present 
Command line args: ['--reinstall', '--debug', '--port', '7860', '--no-hashing', '--insecure', '--listen', '--no-metadata'] insecure=True listen=True no_hashing=True no_metadata=True debug=True reinstall=True                                                                                                                                                                                              
Load packages: torch=2.1.1+cu121 diffusers=0.23.1 gradio=3.43.2                                                                                                                                             
Read: file="config.json" json=43 bytes=1807                                                                                                                                                                 
Unknown settings: ['ad_max_models', 'ad_save_previews', 'ad_save_images_before', 'dp_ignore_whitespace', 'multiple_tqdm']                                                                                   
Engine: backend=Backend.DIFFUSERS compute=cuda mode=no_grad device=cuda cross-optimization="Scaled-Dot-Product"                                                                                             
Device: device=NVIDIA GeForce RTX 4070 Ti n=1 arch=sm_90 cap=(8, 9) cuda=12.1 cudnn=8902 driver=535.104.05

Backend

Diffusers

Branch

Dev

Model

LCM

Acknowledgements

[X] I have read the above and searched for existing issues
[X] I confirm that this is classified correctly and its not an extension issue

vladmandic commented 9 months ago

Enter valid title.

midcoastal commented 9 months ago

Enter valid title.

My bad.

vladmandic commented 9 months ago

how was this model created? original lcm-dreamshaper model can only be loaded in diffusers format, not safetensors. and all others should either be standard sd models with lcm lora loaded on top or with that same lora fused in the model. and for those models, there is no need to trigger "special" lcm loader behavior for model.

if model somehow started from original lcm-dreamshaper, then yes, it would need a special loading handler that right now doesn't exist - in which case, that would be a feature request (and i'd need to discuss it with diffusers team as it needs to be implemented upstream)

midcoastal commented 9 months ago

Ok. I guess where the confusion comes in is when we explicitly detect the LCM in the model name, and assign the Latent Consistency Model guess, which then causes the error when load is attempted. The curious part is we actually throw an error if LCM is detected, returning the guess that leads to this class loading, if not using backend.DIFFUSERS. Hence my being confused...

I do not know how the model was created, only that it is an LCM model, and I can confirm that it is not behaving as expected when loaded as a Stable Diffusion Model, or as an XL model. So I am left to believe that it does indeed need to be loaded as LCM.

Diffusers does support LCM models, for both T2I and I2I, but just not loading this particular way (using from_single_file). Searching for the class LatentConsistencyModelPipeline (which is returned from our own pipeline resolution).

The diffusers library has it's own "AutoPipeline" implimentation in diffusers.pipelines.auto_pipeline in the classes AutoPipelineForImage2Image and AutoPipelineForText2Image. I haven't played with this at all yet, but looking about it, it seems like we should be using from_pretrained rather than from_single_file. But I am not certain.

midcoastal commented 9 months ago

Looking through the code some some, I do see where prom_pretrained is actually loading a diffusers-style directory. Interesting...

I have several models from CivitAI that are marked LCM. Curiously, none of them would trigger the LCM check, as they do not follow the LCM[_-] naming convention, so they get loaded as standard StableDiffusion models, which I guess, as you said, checks out?

vladmandic commented 9 months ago

yes, sdnext is using from_pretrained - for diffusers-style models - that's what its for. for safetensors, there is only from_single_file - and its implemented for some classes, but not for all. i've been working close with diffusers team on gradually adding support to more and more (e.g. if you remember, when sdxl first came out, sdnext didn't support loading from safetensors).

regarding lcm - if there was only one type of lcm models, i would have pushed to add from_single_file to that class long time ago. but as it is, that applies only to original lcm and that is 1% of usage, the rest is normal sd model with either loaded or merged lcm lora on top of it. and for that type of lcm, you load it as normal sd model.

as-it-is, i might disable name-check for lcm safetensors models as it doesn't really serve a purpose in current state.

midcoastal commented 9 months ago

@vladmandic - see #2569

vladmandic commented 9 months ago

explicit load for lcm is removed as not needed.

vladmandic / automatic