Closed WilliamZhaoz closed 5 years ago
Hi @WilliamZhaoz , thanks for your questions.
-Brian
It seems in TensorFlow, quantization can afford speed ups on CPU (https://www.tensorflow.org/lite/performance/post_training_quantization). I have also seen this in academic literature.
Are there any plans to improve the inference of quantized models on CPU to afford such speedups?
Cheers Liam
Hi,