Open Lavreniuk opened 1 month ago
Hi @Lavreniuk , thank you for filing this issue! Apologies for the delayed response. This is a known issue due to one of package dependencies in AI Hub. We will fix this once resolved in the dependent package. Please join our Slack Community for release notes on when this issue is resolved.
The model inference time is much slower
To Reproduce pip install -q qai-hub-models git clone https://github.com/quic/ai-hub-models cd ai-hub-models/qai_hub_models/models/vit python -m qai_hub_models.models.vit.export
Expected behavior here is expected results: https://aihub.qualcomm.com/jobs/jn5qv9l7g
Stack trace here is my run: https://app.aihub.qualcomm.com/jobs/jmg9drmw5/ (in your example all layers are in NPU, in my not).
Also if I download weights from your repo from this commit: https://huggingface.co/qualcomm/VIT/tree/973054da6d5d65b53537f7e608021bded3c4c522 it works fine, but the newest commit it same issue as when I converted by myself (slow inference time).
Host configuration: