Qualcomm Cloud AI SDK (Platform and Apps) enable high performance deep learning inference on Qualcomm Cloud AI platforms delivering high throughput and low latency across Computer Vision, Object Detection, Natural Language Processing and Generative AI models.
I encountered the same issue with two models after converting them to onnx and trying to compile them to qaic:
QAIC_ERROR:
Error message: [Operator-'/model/layers.0/self_attn/ScatterND', opset_version-13, ir_version-7] : Indices and updates must have same lengths!
QAICException:Unable to AddNodesToGraphFromModel
When I use the compileModel.sh I get the following error:
bash compileModel.sh Amber-kv mx6 14
Invalid option -retained-state. Use -h, -help, or --help for list of options.
And because of it I compile it without the -retained-state option. This is my compile command:
Hi, I am following this instruction to compile a llama model (LLM360/Amber and meta-llama/Llama-2-7b-chat-hf) on an A100 machine
https://github.com/quic/cloud-ai-sdk/tree/1.12/models/language_processing/decoder/LlamaForCausalLM
I encountered the same issue with two models after converting them to onnx and trying to compile them to qaic:
When I use the compileModel.sh I get the following error:
And because of it I compile it without the -retained-state option. This is my compile command:
this is my environment:
What can be causing this error?