Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
1.05k
stars
59
forks
source link
How can I load the whole model from the compiled one instead of loading only the unet to current existing sd model? #130
Open
quocanh34 opened 4 months ago