AlexKoff88 / stable_diffusion_quantization

Quantization of Stable Diffusion POC
Apache License 2.0
2 stars 1 forks source link

Model size isn't reducing #1

Open sravanthOppo27 opened 1 year ago

sravanthOppo27 commented 1 year ago

Dear Author , I have tried your code but there isn't any model size reduction for the UNET Part . Is this only Inference speed up mechanism or is there any model size reduction part ??. Am I doing any thing wrong ?

AlexKoff88 commented 1 year ago

Hi,

Thanks for the interest. The reduction happens when you convert model to OpenVINO representation.

You can compare yourself:

sravanthOppo27 commented 1 year ago

Okay , Your code is direct implementation of OpenVINO ?

or better than OpenVINO ( mean lesser model size & Faster inference speed up )

AlexKoff88 commented 1 year ago

Actually, the training in the script is done using PyTorch and after the training process finishes the model is exported to OpenVINO representation (.xml and .bin files) and model size is reduces.