Open jayong-sv opened 2 months ago
Hi @jayong-sv, In the version you are working with, the Clip model is not supported. In the current release (DFC 3.28.0), only the clip_resnet image encoder is supported. More Clip models, including the text encoder, would be supported in the future.
Regards,
@omerwer Hello, The error message I posted was occurred when I converted clip_resnet image encoder. F.multi_head_attention_forward is included in clip_resnet.
I noticed that CLIP is already present in the Hailo Model Zoo, which suggests that conversion is possible. link
I need help converting a model I trained myself. How can I parse CLIP to HAR? After converting a ResNet-based CLIP model to ONNX, I encountered the following error when parsing
torch.nn.functional.multi_head_attention_forward
from ONNX to HAR.Reproduction Code
The code to reproduce the error is as follows.
AttentionPool2d
is taken from the OpenAI CLIP code: linkExecution Environment