Open AlexMa0 opened 4 months ago
autosmoothquant是只支持int8的量化吗?是否可以支持int4的量化?
You can use SmoothQuant to implement w4a8 quantization, but this may result in a non-negligible loss of model performance. If you are interested in performing w4a8 quantization for inference, you can refer to our new project QQQ.
autosmoothquant是只支持int8的量化吗?是否可以支持int4的量化?