Closed kdh0102 closed 3 years ago
For more interesting experiments, models available on DSP, GPU are required.
As DSP needs INT8 quantized models (in case of GPU, INT8/FLOAT16), we need quantized tflite models:
tflite
mediapipe
~Succeeded in making INT8 quantized YOLOv4-tiny.tflite model.~ Quantization fails due to the minimum filter size in a conv. layer.
But,
SPLIT
Even though quantization is done, the lack of op coverage can be a big obstacle.
Refer to our Google Drive for available models. Maybe we can create a .tflite model zoo Github repo later.
.tflite
For more interesting experiments, models available on DSP, GPU are required.
As DSP needs INT8 quantized models (in case of GPU, INT8/FLOAT16), we need quantized
tflite
models:mediapipe
repoNote
~Succeeded in making INT8 quantized YOLOv4-tiny.tflite model.~ Quantization fails due to the minimum filter size in a conv. layer.
But,
SPLIT
ops fall back to CPU, creating 4 GPU delegate ops.Even though quantization is done, the lack of op coverage can be a big obstacle.