How can I load the whole model from the compiled one instead of loading only the unet to current existing sd model?

chengzeyi / stable-fast

Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.

MIT License

1.05k stars 59 forks source link

Open quocanh34 opened 4 months ago