Closed TApplencourt closed 3 years ago
Added in version 0.32.0
Thanks a lot! My cluster is on maintenance, so I can't try it yet :( Maybe a stupid remark but did you forget to update the computation of "Total number of HW thread" (https://github.com/intel/pti-gpu/blob/360b835279fdb3f279221dc0143ef2593e9e3641/samples/cl_gemm_inst/main.cc#L192) with the DSS count?
Thanks Thomas! Indeed it's a bug, fixed in newest version.
Hi Anton,
Maybe you already know, but IGC just added support for
intel_get_dual_subslice_id
(https://github.com/intel/intel-graphics-compiler/commit/cc63edd997606f0b32c5f1648d047e1461821bba). You may want to updatecl_gemm_inst
and co at some point.If you are short on time, I can submit a PR.
Thanks again for this repo!
Regards, Thomas