[Feature Request] SpaceToDepth & DepthToSpace integer implementations

microsoft / onnxruntime

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

MIT License

14.7k stars 2.93k forks source link

Should be relatively simple to try out given the implementations are templatized.

You could extend the list of supported types in the type constraints for the latest opset and add a new branch in the Compute.

https://github.com/microsoft/onnxruntime/blob/4c3c809bdbcde4ea96f0a31a242ca6877a10c40a/onnxruntime/core/providers/cpu/tensor/space_depth_ops.cc#L28-L29 https://github.com/microsoft/onnxruntime/blob/4c3c809bdbcde4ea96f0a31a242ca6877a10c40a/onnxruntime/core/providers/cpu/tensor/space_depth_ops.cc#L53-L54 https://github.com/microsoft/onnxruntime/blob/4c3c809bdbcde4ea96f0a31a242ca6877a10c40a/onnxruntime/core/providers/cpu/tensor/space_depth_ops.cc#L135

Ideally uint8 and int8 are handled in the same branch given they're the same datasize (i.e. we don't want to pay the binary size cost for 2 implementation moving 8-bit data around).

The CUDA implementation seems to be pretty generic already and may just need the addition of the data types in the type constraints.

The kernel registrations in the EPs aren't typed (i.e. the kernel implementation is internally handling the different supported data types) so you shouldn't need to do anything there.

microsoft / onnxruntime

[Feature Request] SpaceToDepth & DepthToSpace integer implementations #21287

Describe the feature request

Describe scenario use case