Open mengniwang95 opened 2 years ago
I can not upload the model file... I am not sure that's why
qdq.zip this is the qdq model
@jywu-msft, does openvino support QDQ natively?
@jywu-msft, does openvino support QDQ natively?
it is supported but maybe it's not optimized for this particular model. will raise with Intel.
@sfatimar, would you be able to take a look at the qdq model?
How was this QDQ Model Generated ? If you want good performance you can use NNCF by Intel to generate QDQ Model which gives very good performance. https://github.com/openvinotoolkit/nncf. We can help you use Post Training Quantization , or QAT (Quantization Aware Training) features of NNCF.
I got a fp32 onnx model first, and then generated the qdq model. How does NNCF generate QDQ model?
Please go through this tool currently in development. https://github.com/openvinotoolkit/nncf/tree/develop/examples/experimental/onnx.
Describe the bug Hi, I use openvino EP to test QDQ model performance but find QDQ model's performance is worse than original fp32 model.\
System information