Closed qz233 closed 5 months ago
Hi @qz233,
You can find the list of NPU-supported ops here. Your MLP and the model with Conv2d -> flatten -> Linear pattern should work. The provider config file (vaip_config.json) currently has a field named "minimum_num_of_conv" which sets the threshold for number of conv layers in the model for it to be off-loaded to the NPU. The default value is set to 2. Since your models have less than 2 conv layers, you see the "not a supported CNN model" message. You can modify the config file to change this default value, and your models will then run on the NPU.
Hi @savitha-srinivasan
Thanks you very much! I will try again once I return home. And may I ask if conv1d is supported by Vitisaiexecuter? Since my ryzen-ai project works on audio, there are quite alot of these operators.
did you get it working @savitha-srinivasan I'm having similar problems.
@qz233 @rhenry74 we do have support for conv1d. We were able to reproduce the issue for your model. Could you try setting XLNX_ENABLE_CONV1D=1 in your environment and retry? This should fix it. Please also delete the cache while retrying.
@savitha-srinivasan Yep, all my problems are solved. Thanks.
Hi team, I an trying to deploy my model on AMD NPU device using VitisAIExecutionProvider. I thought that all supported operators can be computed on NPU, but often I encounter this notice:
I20240527 12:49:21.131315 16676 pass_main.cpp:245] [VITIS AI EP] This model is not a supported CNN model which will not be compiled with DPU.
So is this tool only supporting pure CNN architecture? And how can I use this tool for other kinds of model (such a Transformers)
Here is the tests I did:
I20240527 13:09:21.737005 19076 PartitionPass.cpp:6479] xir::Op{name = input_DequantizeLinear_Output_/layer1/Conv_output_0, type = fix2float} is not supported by current target. Target name: AMD_AIE2_Nx4_Overlay, target type: IPU_PHX. Assign it to CPU.
[Vitis AI EP] No. of Operators : CPU 2 IPU 15 88.24%
(which is weird because the ResNet test case has a similar structure)
My test code: