QAT. How to check that the model is successfully quantized?

Deci-AI / super-gradients

Easily train or fine-tune SOTA computer vision models with one open source training library. The home of Yolo-NAS.

Apache License 2.0

4.6k stars 510 forks source link

💡 Your Question

I tuned the quantization weights during training and got an onnx model with Q/DQ layers as output. However, when I use TensorRt to convert a file to an engine with int8 precision, I get the following message: Calibrator won't be used in explicit precision mode. Use quantization aware training to generate network with Quantize/Dequantize nodes

The command with which I started quantization:

trtexec --onnx=./yolox_s_tree_pole_16x3x640x640_qat.onnx --int8 --saveEngine=./yolox_s_tree_pole_16x3x640x640_qat.engine

Questions:

Is this message objective?
Is there anything you can recommend to ensure that the model is quantized to int8 precision?

Versions

No response

Deci-AI / super-gradients

QAT. How to check that the model is successfully quantized? #1784

💡 Your Question

Versions

1244 , Any update on this issue, it's still the same.