Open Novelfor opened 2 months ago
Hi, @Novelfor did you found any rectification?.. for the qnn model run on the htp the same issue iam also facing runs on cpu not htp ` 0.0ms [ ERROR ] graph_prepare.cc:203:ERROR:could not create op: q::flat_from_vtcm
0.0ms [ ERROR ] graph_prepare.cc:1187:ERROR:Op 0x1a47e400000026 preparation failed with err:-1
0.0ms [ ERROR ] QnnDsp <E> "_encoder_backbone_backbone_0_Conv" generated: could not create op
0.0ms [ ERROR ] QnnDsp <E> RouterX86 graph prepare failed 12
0.0ms [ ERROR ] QnnDsp <E> Failed to finalize graph (id: 1) with err 1002
0.0ms [VERBOSE] QnnDsp <V> Wake up free backend 1 thread(s)
0.0ms [ INFO ] QnnDsp <I> QnnGraph_finalize done. status 0x3ea
43309.1ms [ ERROR ] Finalize Graph for Idx = 0 failed with error = 1002
Graph Finalize failure
0.0ms [VERBOSE] QnnDsp
0.0ms [WARNING] QnnDsp <W> Backend 1 free cleanup called during process exit
0.0ms [VERBOSE] QnnDsp <V> Terminated backend 0x1 successfully in backendLifecycleManager
0.0ms [VERBOSE] QnnDsp <V> Final context cleanup: contextId = 1!
0.0ms [VERBOSE] QnnDsp <V> qnnOpPackageManager: unloading OpPackages...
0.0ms [VERBOSE] QnnDsp <V> qnnOpPackageManager: OpPackge already unloaded.
0.0ms [VERBOSE] QnnDsp <V> RouterNative tryUnLoadPrepare Disabled`
I try use aimet to quantize transformer... it works https://github.com/quic/aimet
Describe the issue
I train a qat self attention model by Pytorch FX, the model can be run in libQnnCpu.so but error in libQnnHtp.so. The model run in linux x86. QNN: 2.20.0.240223 ERROR Message:
To reproduce
I write the minimal reproduce code, the pytorch code to generate "test_int8.onnx", and use c++ code to run it.
I only test it in linux x86, but i guess it will be consistent on the Android side.
Init this model in c++
Urgency
No response
Platform
Android
OS Version
linux
ONNX Runtime Installation
Built from Source
Compiler Version (if 'Built from Source')
gcc9
Package Name (if 'Released Package')
None
ONNX Runtime Version or Commit ID
8c2689877fb48bfb4a6a133b020cdb5ec7c9b066
ONNX Runtime API
C++/C
Architecture
X64
Execution Provider
Other / Unknown
Execution Provider Library Version
QNN: 2.20.0.240223