ROCm / MIOpen

AMD's Machine Intelligence Library
https://rocm.docs.amd.com/projects/MIOpen/en/latest/
Other
1.04k stars 217 forks source link

Post #2334 Merge Issue: Reference kernel for 3D convolution for non-packed tensors #2440

Open junliume opened 11 months ago

junliume commented 11 months ago

UPDATE:

In discussion with @JehandadKhan, we decided not to include strides of non-packed tensors to the database keys. Only an optional flag (saying that at least one tensor is non-packed) should be included there. The above means that databases will share the same find-db records, same Invoker instances and same perf-db information for the non-packed convolutions that differ only in strides.

The above design should work correctly provided that:

From now on, the above is becoming a part of the Solver/Solution/Invoker specification.

@amberhassaan @junliume

_Originally posted by @atamazov in https://github.com/ROCmSoftwarePlatform/MIOpen/pull/2334#discussion_r1348090910_

junliume commented 11 months ago

@atamazov to add more necessary details here

atamazov commented 11 months ago

Unresolved review comments:

@junliume Thanks for opening the issue. I recommend cleaning the topmost comment (issue description) as it is obsolete now.