When ever I infer the quantized model in java using One DNN executive provider, I am getting the below error.
2024-04-05 12:37:23.831441312 [E:onnxruntime:, sequential_executor.cc:516 ExecuteKernel] Non-zero status code returned while running DNNL_9692988425953928956_1 node. Name:'DnnlExecutionProvider_DNNL_9692988425953928956_1_1' Status Message: /onnxruntime/onnxruntime/core/providers/dnnl/subgraph/dnnl_dequantizelinear.cc:191 void onnxruntime::ort_dnnl::DnnlDequantizeLinear::ValidateDims(onnxruntime::ort_dnnl::DnnlSubgraphPrimitive&, onnxruntime::ort_dnnl::DnnlNode&) x_scale and x_zero_point dimensions does not match
Please note that when I remove options.addDnnl(true); from the session options, the same model and script work well. I tried running the ONNX model (not quantized), and it also works fine.
This issue occurs when I infer the model with different inputs. For example, if I send the input "test" during the first inference, I receive the corresponding vector. However, in the second model call, when I try inputs other than "test", it shows me an error.
This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.
Describe the issue
I have quantized sentence-transformers/paraphrase-multilingual-MiniLM-L12-v2 model using the below script.
I generated the build using the below command
When ever I infer the quantized model in java using One DNN executive provider, I am getting the below error.
Please note that when I remove
options.addDnnl(true);
from the session options, the same model and script work well. I tried running the ONNX model (not quantized), and it also works fine.This issue occurs when I infer the model with different inputs. For example, if I send the input "test" during the first inference, I receive the corresponding vector. However, in the second model call, when I try inputs other than "test", it shows me an error.
To reproduce
Please find the models and Jar here
Urgency
No response
Platform
Linux
OS Version
Ubuntu 22.04.3
ONNX Runtime Installation
Built from Source
ONNX Runtime Version or Commit ID
1.18.0
ONNX Runtime API
Java
Architecture
X64
Execution Provider
oneDNN
Execution Provider Library Version
No response