microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
https://onnxruntime.ai
MIT License
14.81k stars 2.94k forks source link

[ARM] MatMulNBits FP16 support - kernels only #22806

Closed fajin-corp closed 1 week ago

fajin-corp commented 2 weeks ago

Description

A break down PR of https://github.com/microsoft/onnxruntime/pull/22651 Add fp16 kernels.

Motivation and Context