Closed tahmiddialpad closed 11 months ago
Hello, @tahmiddialpad,
Thank you for interest to SpQR compression.
When the quantize() function is called in line 217 it quantizes and then immediately dequantizes the model weights. See lines 178-189 in spqr_engine.py(). The line that you mentioned would return quantizers necessary to save the compressed model. This saving functionality is not yet implemented.
I would like to know how the quantized model's performance is evaluated while keeping this line blank?
How are the quantized weights being considered while evaluating the performance of the model since the value of quantizers here is empty?