intel / onnxruntime

ONNX Runtime: cross-platform, high performance scoring engine for ML models
MIT License
57 stars 23 forks source link

Ericcraw/ort allocator hacking #435

Closed saurabhkale17 closed 2 months ago

saurabhkale17 commented 2 months ago

Description

Avoid memcpy for input and output tensor for OVEP NPU. Allocating buffer to remote device using the allocator.