Open gformcreation opened 2 months ago
AMD’s NPU has an implementation in this repository, but its performance is poor. I’ve done some exploration, but I couldn’t even pass the unit tests for basic op, so I believe that support for AMD’s NPU might take a long time, unless AMD deems it worth the effort.
volunteer to test intel AI boost (NPU) here.
AMD's NPU is XDNA, for current state see https://github.com/ggerganov/llama.cpp/issues/1499
For INTEL's NPU this may be the right issue https://github.com/ggerganov/llama.cpp/issues/5079
QUALCOMM's NPU is Hexagon, see https://github.com/ggerganov/llama.cpp/issues/2687
can we support more NPU liked devices like MediaTek APU(AI Processing Unit),its development kit named NeuroPilot SDK,link
any update?
Prerequisites
Feature Description
First, thank you for your incredible work on this project! To enhance its performance, especially on mobile devices and NPU-enabled PCs like those with Copilot+, I would love to see support for Neural Processing Units (NPUs).
Motivation
Integrating NPU support would significantly improve the speed and efficiency of AI tasks, offering users a smoother and more responsive experience in terms of quick response generation.
Possible Implementation
No response