opensearch-project / ml-commons

ml-commons provides a set of common machine learning algorithms, e.g. k-means, or linear regression, to help developers build ML related features within OpenSearch.
Apache License 2.0
88 stars 125 forks source link

[FEATURE] Utilize Apple Neural Engine #2550

Open rbpasker opened 1 month ago

rbpasker commented 1 month ago

I'm wondering if there is a plan to deploy on ANE

https://machinelearning.apple.com/research/neural-engine-transformers

This year at WWDC 2022, Apple is making available an open-source reference PyTorch implementation of the Transformer architecture, giving developers worldwide a way to seamlessly deploy their state-of-the-art Transformer models on Apple devices. This implementation is specifically optimized for the Apple Neural Engine (ANE), the energy-efficient and high-throughput engine for ML inference on Apple silicon. It will help developers minimize the impact of their ML inference workloads on app memory, app responsiveness, and device battery life. Increasing the adoption of on-device ML deployment will also benefit user privacy, since data for inference workloads remains on-device, not on the server.

dblock commented 3 weeks ago

[Catch All Triage - Attendees 1, 2, 3, 4, 5]

Thanks for opening this.