microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
https://onnxruntime.ai
MIT License
14.77k stars 2.94k forks source link

[Feature Request] Add official support for onnxruntime-gpu on ARM64/aarch64 platforms #22903

Open abhishek-iitmadras opened 1 day ago

abhishek-iitmadras commented 1 day ago

Describe the feature request

Issue Description Currently, onnxruntime-gpu package lacks official support for ARM64/aarch64 architecture, limiting GPU acceleration capabilities on increasingly popular ARM-based platforms.

Current Situation

No official pre-built wheels for onnxruntime-gpu on ARM64/aarch64 Limited documentation for ARM64 GPU deployment

Technical Details

Proposed Solution

Official pre-built wheels for ARM64/aarch64 CI/CD pipeline additions for ARM64 builds

Would appreciate any feedback or guidance on how to make this happen?

Describe scenario use case

Use Case : Growing adoption of ARM64 in edge/hpc computing. Cloud deployments on ARM64-based servers (AWS Graviton, etc.) Machine learning workloads on newer ARM-based development machines IoT and embedded systems requiring GPU acceleration

abhishek-iitmadras commented 1 day ago

cc @snnn @skottmckay @edgchen1 @fs-eire @tiantun @hariharans29 @yuslepukhin @mszhanyi @baijumeswani @yufenglee @pengwa @pranavsharma @guoyu-wang @centwang @HectorSVC @natke @adrianlizarraga @wangyems @jchen351 @faxu @RandySheriff @fdwr @jywu-msft @askhade