CPU Onnxruntime returns incorrect result for UINT8 quantized model (contains just 1 matmul shape(1,4) @ shape(4,1)) with the following env:
onnx==1.14onnxruntime==1.16protobuf==4.24.4
Passing on CPU: AMD Ryzen 9 7900X 12-Core Processor ; correct output is 0.22868575
Failing on CPU: AMD Ryzen Threadripper 2950X 16-Core Processor; incorrect output is -0.44277453
This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.
Describe the issue
CPU Onnxruntime returns incorrect result for UINT8 quantized model (contains just 1 matmul
shape(1,4) @ shape(4,1)
) with the following env:onnx==1.14
onnxruntime==1.16
protobuf==4.24.4
Passing on CPU: AMD Ryzen 9 7900X 12-Core Processor ; correct output is 0.22868575 Failing on CPU: AMD Ryzen Threadripper 2950X 16-Core Processor; incorrect output is -0.44277453
To reproduce
Onnx file: mm_no_bias_uint8.tar.gz
Script to repro:
Urgency
Customer release is blocked by this issue.
Platform
Linux
OS Version
20.04.6 LTS (Focal Fossa)
ONNX Runtime Installation
Released Package
ONNX Runtime Version or Commit ID
1.16
ONNX Runtime API
Python
Architecture
X86
Execution Provider
Default CPU
Execution Provider Library Version
No response