Stable Diffusion Quantization

Anthrapper / On-Device-Stable-Diffusion

On Device Stable Diffusion In Mobile Devices

MIT License

34 stars 11 forks source link

I applied dynamic quantization to both the TFLite models: the diffusion model and the text_encoder model. However, I encountered difficulties with the diffusion modeldue to its large size and couldn't find a suitable method to quantize it using the ONNX library at that time. Furthermore, the inference time of the text_encoder model did not significantly improve with the INT8 ONNX model, so I decided to keep the TFLite version for simplicity. Attached is the notebook I used for converting and quantizing these models in this project.

integerquant.zip

Anthrapper / On-Device-Stable-Diffusion

Stable Diffusion Quantization #3