Open ziyanxzy opened 1 month ago
@ziyanxzy , at this moment, ther is no possibility to cover this operation in openvino with preserving dynamic shapes. Analizing model, I can say that this is part is problematic also for model tracing in torchscript too (it uses constant folding for resolve cycle in resampler forward and as the result, model can not process images with different from shapes used during tracing, number of patches and image sizes. I prepared reference code for model conversion and inference in openvino in this notebook https://github.com/openvinotoolkit/openvino_notebooks/pull/2302
Could you please check?
OpenVINO Version
2024.4.0-16283-41691a36b90
Operating System
Windows System
Device used for inference
CPU
Framework
PyTorch
Model used
https://hf-mirror.com/openbmb/MiniCPM-V-2_6
Issue description
when i try to convert minicpmv2.6 (image_encoder), it report: Summary: -- No conversion rule found for operations: aten::pad_sequence -- Conversion is failed for: prim::ListConstruct
Step-by-step reproduction
Relevant log output
Issue submission checklist