Closed linehill closed 3 months ago
I see some failures that seem unrelated to this PR, let me investigate.
Rebased now.
Only this is failing now
dgpu_opencl_make_check_result.txt: FAIL
358 - Unit_hipClassKernel_Friend (SEGFAULT)
This is fixed in #791 so let's merge that first
conflits
Rebased. Should have compiler error resolved seen in the CI in the merge prior this rebase.
Calling clSetKernelExecInfo() for passing CL_KERNEL_EXEC_INFO_SVM_PTRS demonstrates notable overhead on PVC. For example, HeCBench/hybridsort-hip sees 4-16% slowdown.
This patch adds a rudimentary analysis for detecting whether device modules might have indirect buffer accesses. The OpenCL backend skips clSetKernelExecInfo() calls when it knows the module, the kernel belongs to, does not have any indirect accesses.