Closed Ashutosh-Londhe closed 1 month ago
Recent changes to Images means that the CUDA backend requires the nvidia runtime toolkit with version >= 11.5 I am in the middle of updating the gettingStartedGuide.md to reflect this. If you use a toolkit version >=11.5 then this should resolve your issue.
Hi Jack,
I try bulding with CUDA toolkit 12.0 and able to build it But when i tried buliding and running the test application mentioned on Getting started guide, I am able to build, but getting following errors
`clang++ -fsycl simple-sycl-app.cpp -o simple-sycl-app.exe
./simple-sycl-app.exe
terminate called after throwing an instance of 'sycl::_V1::runtime_error'
what(): No device of requested type available. -1 (PI_ERROR_DEVICE_NOT_FOUND)
Aborted
`
`clang++ -fsycl -fsycl-targets=nvptx64-nvidia-cuda simple-sycl-app.cpp -o simple-sycl-app-cuda.exe --cuda-path=/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/cuda/12.0
ONEAPI_DEVICE_SELECTOR=cuda:* ./simple-sycl-app-cuda.exe
terminate called after throwing an instance of 'sycl::_V1::runtime_error'
what(): No device of requested type available. -1 (PI_ERROR_DEVICE_NOT_FOUND)
Aborted
`
without cuda one should run on CPU right?
Are you getting sensible results from
$ nvidia-smi.exe
? (if nvidia-smi.exe is not found in your path then you'll have to find where it is located - it is installed somewhere by the cuda toolkit)
Yes Jack
here is the output from Nvidia-smi
`nvidia-smi
Fri Aug 4 13:08:40 2023
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 460.73.01 Driver Version: 460.73.01 CUDA Version: 11.2 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
| | | MIG M. |
|===============================+======================+======================|
| 0 Tesla V100-PCIE... On | 00000000:86:00.0 Off | 0 |
| N/A 30C P0 24W / 250W | 0MiB / 16160MiB | 0% Default |
| | | N/A |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: |
| GPU GI CI PID Type Process name GPU Memory |
| ID ID Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+`
Yes Jack
here is the output from Nvidia-smi
`nvidia-smi Fri Aug 4 13:08:40 2023 +-----------------------------------------------------------------------------+ | NVIDIA-SMI 460.73.01 Driver Version: 460.73.01 CUDA Version: 11.2 | |-------------------------------+----------------------+----------------------+ | GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC | | Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. | | | | MIG M. | |===============================+======================+======================| | 0 Tesla V100-PCIE... On | 00000000:86:00.0 Off | 0 | | N/A 30C P0 24W / 250W | 0MiB / 16160MiB | 0% Default | | | | N/A | +-------------------------------+----------------------+----------------------+ +-----------------------------------------------------------------------------+ | Processes: | | GPU GI CI PID Type Process name GPU Memory | | ID ID Usage | |=============================================================================| | No running processes found | +-----------------------------------------------------------------------------+`
This looks problematic. It states that the cuda version of the installed driver and toolkit that is in your path is 11.2, but you have apparently used 12.0 to build intel/llvm. I think you need to make sure you are using 12.0 and a compatible driver. I think you may have to uninstall the existing driver that is apparently for toolkit 11.2. Then install an appropriate driver for 12.0 and your system: https://www.nvidia.com/Download/index.aspx?lang=en-us.
I have been able to build and run simple-sycl-app.cpp fine myself using the build/run instructions you have used.
Thanks Jack, i will do the needful
Hi Jack, Do you mean update the CUDA driver on GPU to 12.0??
I've been thinking about this. It certainly works in many cases that you can use a new toolkit with an earlier driver version. It may be that in this case you do have to update the driver version, but to be honest I'm not certain that is your issue. Perhaps there is some other issue with your build.
If you tell me the complete steps you took to install dpc++ that would help?
Thanks Jack, i will do the needful
can you do
echo $LD_LIBRARY_PATH
and also do
SYCL_PI_TRACE=-1 ./simple-sycl-app.exe
and post the results?
Hopefully this will provide more information, Thanks.
`echo $LD_LIBRARY_PATH
/ext-home/asl/install/sycl/llvm/build/lib:
asl@telos:~/install/sycl$ SYCL_PI_TRACE=-1 ./simple-sycl-app.exe
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_opencl.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_level_zero.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so: undefined symbol: cuDeviceGetUui d_v2>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_cuda.so
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_hip.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_unified_runtime.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so: cannot open shared o bject file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_native_cpu.so
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piPlatformsGet(
<unknown> : 2
<unknown> : 0xb02de0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02de0[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b73f9c
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb02c50
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02c50[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16448
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb16450
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb16450[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73cb8
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb2fd90
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 12
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: NVIDIA CUDA OpenCL platform found but is not compatible.
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b741ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1fe90
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1fe90[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16098
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb160a0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb160a0[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73f08
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb30010
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Requested device_type: info::device_type::automatic
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x7ffd02b74138
<unknown> : 1
<unknown> : 0x7ffd02b74134
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74138[ 0x4129d0 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb25b90
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Candidate device: -> score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb14820
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Selected device: -> final score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b740ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1feb0[ 0xaef578 ... ]
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piContextCreate(
<nullptr>
<unknown> : 1
<unknown> : 0xb26780
<nullptr>
<nullptr>
<unknown> : 0xb30378
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb26780[ 0xaef578 ... ]
[out]<unknown> ** : 0xb30378[ 0xb30678 ... ]
---> piextQueueCreate(
<unknown> : 0xb30678
<unknown> : 0xaef578
<unknown> : 0x7ffd02b740e0
<unknown> : 0x7ffd02b740b8
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b740b8[ 0xb162f8 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4149
<unknown> : 4
<unknown> : 0x7ffd02b72c3c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piMemBufferCreate(
<unknown> : 0xb30678
<unknown> : 1
<unknown> : 32
<unknown> : 0
<unknown> : 0x7ffd02b72d10
<unknown> : 0
) ---> pi_result : PI_SUCCESS
[out]void * : 0
[out]pi_mem * : 0x7ffd02b72d10[ 0x2087878 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 65576
<unknown> : 4
<unknown> : 0x7ffd02b725bc
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x2088050
<unknown> : 1
<unknown> : 0x7ffd02b724dc
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x2088050[ 0x412910 ... ]
---> piPluginGetBackendOption(
pi_platform : 0xb00d58
<const char *>: -O2
<unknown> : 0x7ffd02b72350
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72350[ ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4136
<unknown> : 4
<unknown> : 0x7ffd02b7236c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b722e8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 17
<char * > : 0x2088070
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piProgramCreate(
<unknown> : 0xb30678
<unknown> : 0x412a60
<unknown> : 5552
<unknown> : 0x7ffd02b72368
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72368[ 0x20881c8 ... ]
---> piProgramBuild(
<unknown> : 0x20881c8
<unknown> : 1
<unknown> : 0xb16090
<const char *>:
<nullptr>
<nullptr>
InvalidModule: Invalid SPIR-V module: input SPIR-V module uses unknown extension 'SPV_KHR_expect_assume'``echo $LD_LIBRARY_PATH
/ext-home/asl/install/sycl/llvm/build/lib:
asl@telos:~/install/sycl$ SYCL_PI_TRACE=-1 ./simple-sycl-app.exe
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_opencl.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_level_zero.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so: undefined symbol: cuDeviceGetUui d_v2>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_cuda.so
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_hip.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_unified_runtime.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so: cannot open shared o bject file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_native_cpu.so
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piPlatformsGet(
<unknown> : 2
<unknown> : 0xb02de0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02de0[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b73f9c
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb02c50
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02c50[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16448
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb16450
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb16450[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73cb8
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb2fd90
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 12
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: NVIDIA CUDA OpenCL platform found but is not compatible.
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b741ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1fe90
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1fe90[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16098
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb160a0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb160a0[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73f08
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb30010
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Requested device_type: info::device_type::automatic
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x7ffd02b74138
<unknown> : 1
<unknown> : 0x7ffd02b74134
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74138[ 0x4129d0 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb25b90
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Candidate device: -> score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb14820
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Selected device: -> final score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b740ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1feb0[ 0xaef578 ... ]
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piContextCreate(
<nullptr>
<unknown> : 1
<unknown> : 0xb26780
<nullptr>
<nullptr>
<unknown> : 0xb30378
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb26780[ 0xaef578 ... ]
[out]<unknown> ** : 0xb30378[ 0xb30678 ... ]
---> piextQueueCreate(
<unknown> : 0xb30678
<unknown> : 0xaef578
<unknown> : 0x7ffd02b740e0
<unknown> : 0x7ffd02b740b8
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b740b8[ 0xb162f8 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4149
<unknown> : 4
<unknown> : 0x7ffd02b72c3c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piMemBufferCreate(
<unknown> : 0xb30678
<unknown> : 1
<unknown> : 32
<unknown> : 0
<unknown> : 0x7ffd02b72d10
<unknown> : 0
) ---> pi_result : PI_SUCCESS
[out]void * : 0
[out]pi_mem * : 0x7ffd02b72d10[ 0x2087878 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 65576
<unknown> : 4
<unknown> : 0x7ffd02b725bc
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x2088050
<unknown> : 1
<unknown> : 0x7ffd02b724dc
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x2088050[ 0x412910 ... ]
---> piPluginGetBackendOption(
pi_platform : 0xb00d58
<const char *>: -O2
<unknown> : 0x7ffd02b72350
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72350[ ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4136
<unknown> : 4
<unknown> : 0x7ffd02b7236c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b722e8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 17
<char * > : 0x2088070
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piProgramCreate(
<unknown> : 0xb30678
<unknown> : 0x412a60
<unknown> : 5552
<unknown> : 0x7ffd02b72368
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72368[ 0x20881c8 ... ]
---> piProgramBuild(
<unknown> : 0x20881c8
<unknown> : 1
<unknown> : 0xb16090
<const char *>:
<nullptr>
<nullptr>
InvalidModule: Invalid SPIR-V module: input SPIR-V module uses unknown extension 'SPV_KHR_expect_assume'`
````echo $LD_LIBRARY_PATH
/ext-home/asl/install/sycl/llvm/build/lib:
asl@telos:~/install/sycl$ SYCL_PI_TRACE=-1 ./simple-sycl-app.exe
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_opencl.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_level_zero.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so: undefined symbol: cuDeviceGetUui d_v2>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_cuda.so
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_hip.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_unified_runtime.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so: cannot open shared o bject file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_native_cpu.so
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piPlatformsGet(
<unknown> : 2
<unknown> : 0xb02de0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02de0[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b73f9c
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb02c50
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02c50[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16448
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb16450
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb16450[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73cb8
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb2fd90
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 12
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: NVIDIA CUDA OpenCL platform found but is not compatible.
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b741ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1fe90
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1fe90[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16098
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb160a0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb160a0[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73f08
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb30010
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Requested device_type: info::device_type::automatic
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x7ffd02b74138
<unknown> : 1
<unknown> : 0x7ffd02b74134
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74138[ 0x4129d0 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb25b90
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Candidate device: -> score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb14820
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Selected device: -> final score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b740ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1feb0[ 0xaef578 ... ]
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piContextCreate(
<nullptr>
<unknown> : 1
<unknown> : 0xb26780
<nullptr>
<nullptr>
<unknown> : 0xb30378
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb26780[ 0xaef578 ... ]
[out]<unknown> ** : 0xb30378[ 0xb30678 ... ]
---> piextQueueCreate(
<unknown> : 0xb30678
<unknown> : 0xaef578
<unknown> : 0x7ffd02b740e0
<unknown> : 0x7ffd02b740b8
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b740b8[ 0xb162f8 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4149
<unknown> : 4
<unknown> : 0x7ffd02b72c3c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piMemBufferCreate(
<unknown> : 0xb30678
<unknown> : 1
<unknown> : 32
<unknown> : 0
<unknown> : 0x7ffd02b72d10
<unknown> : 0
) ---> pi_result : PI_SUCCESS
[out]void * : 0
[out]pi_mem * : 0x7ffd02b72d10[ 0x2087878 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 65576
<unknown> : 4
<unknown> : 0x7ffd02b725bc
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x2088050
<unknown> : 1
<unknown> : 0x7ffd02b724dc
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x2088050[ 0x412910 ... ]
---> piPluginGetBackendOption(
pi_platform : 0xb00d58
<const char *>: -O2
<unknown> : 0x7ffd02b72350
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72350[ ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4136
<unknown> : 4
<unknown> : 0x7ffd02b7236c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b722e8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 17
<char * > : 0x2088070
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piProgramCreate(
<unknown> : 0xb30678
<unknown> : 0x412a60
<unknown> : 5552
<unknown> : 0x7ffd02b72368
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72368[ 0x20881c8 ... ]
---> piProgramBuild(
<unknown> : 0x20881c8
<unknown> : 1
<unknown> : 0xb16090
<const char *>:
<nullptr>
<nullptr>
InvalidModule: Invalid SPIR-V module: input SPIR-V module uses unknown extension 'SPV_KHR_expect_assume'``echo $LD_LIBRARY_PATH
/ext-home/asl/install/sycl/llvm/build/lib:
asl@telos:~/install/sycl$ SYCL_PI_TRACE=-1 ./simple-sycl-app.exe
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_opencl.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_level_zero.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so: undefined symbol: cuDeviceGetUui d_v2>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_cuda.so
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_hip.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_unified_runtime.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so: cannot open shared o bject file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_native_cpu.so
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piPlatformsGet(
<unknown> : 2
<unknown> : 0xb02de0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02de0[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb31310
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b73f9c
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb02c50
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb02c50[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16448
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb16450
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb16450[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73cb8
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb2fd90
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 135168
<unknown> : 4
<unknown> : 0x7ffd02b7408c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73fe8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb1fd30
<unknown> : 2306
<unknown> : 12
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: NVIDIA CUDA OpenCL platform found but is not compatible.
---> piPlatformsGet(
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b7412c
) ---> pi_result : PI_SUCCESS
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b741ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1fe90
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1fe90[ 0xaef578 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4096
<unknown> : 8
<unknown> : 0xb16098
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4162
<unknown> : 8
<unknown> : 0xb160a0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb160a0[ 0 ... ]
---> piDeviceRetain(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b73f08
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4144
<unknown> : 623
<char * > : 0xb30010
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Requested device_type: info::device_type::automatic
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x7ffd02b74138
<unknown> : 1
<unknown> : 0x7ffd02b74134
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74138[ 0x4129d0 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb25b90
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Candidate device: -> score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4145
<unknown> : 8
<unknown> : 0x7ffd02b74188
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b74188[ 0xb00d58 ... ]
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b741a8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2306
<unknown> : 16
<char * > : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b74198
) ---> pi_result : PI_SUCCESS
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4139
<unknown> : 41
<char * > : 0xb14820
<nullptr>
) ---> pi_result : PI_SUCCESS
SYCL_PI_TRACE[all]: Selected device: -> final score = 1300
SYCL_PI_TRACE[all]: platform: Intel(R) OpenCL
SYCL_PI_TRACE[all]: device: Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 0
<unknown> : 0
<unknown> : 0x7ffd02b740ec
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0[ nullptr ]
---> piDevicesGet(
pi_platform : 0xb00d58
<unknown> : 4294967295
<unknown> : 1
<unknown> : 0xb1feb0
<nullptr>
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb1feb0[ 0xaef578 ... ]
---> piDeviceRelease(
<unknown> : 0xaef578
) ---> pi_result : PI_SUCCESS
---> piContextCreate(
<nullptr>
<unknown> : 1
<unknown> : 0xb26780
<nullptr>
<nullptr>
<unknown> : 0xb30378
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0xb26780[ 0xaef578 ... ]
[out]<unknown> ** : 0xb30378[ 0xb30678 ... ]
---> piextQueueCreate(
<unknown> : 0xb30678
<unknown> : 0xaef578
<unknown> : 0x7ffd02b740e0
<unknown> : 0x7ffd02b740b8
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b740b8[ 0xb162f8 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4149
<unknown> : 4
<unknown> : 0x7ffd02b72c3c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piMemBufferCreate(
<unknown> : 0xb30678
<unknown> : 1
<unknown> : 32
<unknown> : 0
<unknown> : 0x7ffd02b72d10
<unknown> : 0
) ---> pi_result : PI_SUCCESS
[out]void * : 0
[out]pi_mem * : 0x7ffd02b72d10[ 0x2087878 ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 65576
<unknown> : 4
<unknown> : 0x7ffd02b725bc
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piextDeviceSelectBinary(
<unknown> : 0xaef578
<unknown> : 0x2088050
<unknown> : 1
<unknown> : 0x7ffd02b724dc
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x2088050[ 0x412910 ... ]
---> piPluginGetBackendOption(
pi_platform : 0xb00d58
<const char *>: -O2
<unknown> : 0x7ffd02b72350
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72350[ ... ]
---> piDeviceGetInfo(
<unknown> : 0xaef578
<unknown> : 4136
<unknown> : 4
<unknown> : 0x7ffd02b7236c
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 0
<nullptr>
<unknown> : 0x7ffd02b722e8
) ---> pi_result : PI_SUCCESS
---> piPlatformGetInfo(
pi_platform : 0xb00d58
<unknown> : 2305
<unknown> : 17
<char * > : 0x2088070
<nullptr>
) ---> pi_result : PI_SUCCESS
---> piProgramCreate(
<unknown> : 0xb30678
<unknown> : 0x412a60
<unknown> : 5552
<unknown> : 0x7ffd02b72368
) ---> pi_result : PI_SUCCESS
[out]<unknown> ** : 0x7ffd02b72368[ 0x20881c8 ... ]
---> piProgramBuild(
<unknown> : 0x20881c8
<unknown> : 1
<unknown> : 0xb16090
<const char *>:
<nullptr>
<nullptr>
InvalidModule: Invalid SPIR-V module: input SPIR-V module uses unknown extension 'SPV_KHR_expect_assume'`
This is what i get when dont add cuda libs to path
If i add CUDA libs to path then
echo $LD_LIBRARY_PATH /opt/nvidia/hpc_sdk/Linux_x86_64/23.1/comm_libs/nvshmem/lib:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/comm_libs/nccl/lib:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/comm_libs/mpi/lib:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/math_libs/lib64:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/compilers/lib:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/compilers/extras/qd/lib:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/cuda/extras/CUPTI/lib64:/opt/nvidia/hpc_sdk/Linux_x86_64/23.1/cuda/lib64:/ext-home/asl/install/sycl/llvm/build/lib: asl@telos:~/install/sycl$ SYCL_PI_TRACE=-1 ./simple-sycl-app.exe SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_opencl.so) failed with </opt/nvidia/hpc_sdk/Linux_x86_64/23.1/cuda/lib64/libOpenCL.so.1: version
OPENCL_2.1' not found (required by /ext-home/asl/install/sycl/llvm/build/lib/libpi_opencl.so)>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_opencl.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_level_zero.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_cuda.so: undefined symbol: cuDeviceGetUuid_v2>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_cuda.so
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_hip.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_hip.so
SYCL_PI_TRACE[basic]: Plugin found and successfully loaded: libpi_unified_runtime.so [ PluginVersion: 14.37.1 ]
SYCL_PI_TRACE[-1]: dlopen(/ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so) failed with </ext-home/asl/install/sycl/llvm/build/lib/libpi_native_cpu.so: cannot open shared object file: No such file or directory>
SYCL_PI_TRACE[all]: Check if plugin is present. Failed to load plugin: libpi_native_cpu.so
---> piPlatformsGet(
I see. You are using the intel/llvm sycl branch? what is the commit from your checked out intel/llvm sycl branch?
When you updated the cuda toolkit did you build intel/llvm from scratch by deleting the existing intel/llvm/build folder?
Yes Jack, i am using using sycl branch
latest commit is
`commit ec6ea1ca8b784601d21e2c849733389412bab999 (HEAD -> sycl, origin/sycl, origin/HEAD) Author: Vyacheslav Klochkov vyacheslav.n.klochkov@intel.com Date: Mon Jul 31 16:17:25 2023 -0500
[ESIMD][E2E] Fix flaky-ness of slm_gather_scatter.cpp test (#10629)
Signed-off-by: Vyacheslav N Klochkov <vyacheslav.n.klochkov@intel.com>`
and yes i did deleted llvm/build folder before starting configure
Yes Jack, i am using using sycl branch
latest commit is
`commit ec6ea1c (HEAD -> sycl, origin/sycl, origin/HEAD) Author: Vyacheslav Klochkov vyacheslav.n.klochkov@intel.com Date: Mon Jul 31 16:17:25 2023 -0500
[ESIMD][E2E] Fix flaky-ness of slm_gather_scatter.cpp test (#10629) Signed-off-by: Vyacheslav N Klochkov <vyacheslav.n.klochkov@intel.com>` and yes i did deleted llvm/build folder before starting configure
Ah, try moving all libOpenCL*.so libs from your lib64 folder in your cuda 12.0 toolkit install. And make sure that this lib64 folder is in your LD_LIBRARY_PATH (this is a standard step in cuda toolkit installation).
This may be an incompatibility with your opencl dpc++ installation and cuda opencl libs.
I am actually working on cluster with non-sudo access so unfortunately cannot do this moving of OpenCL*.so out of lib63 folder
I am actually working on cluster with non-sudo access so unfortunately cannot do this moving of OpenCL*.so out of lib63 folder
You can try a special ordering: add the path containing the correct OpenCL.so libs to LD_LIBRARY_PATH after adding the cuda lib64 path to LD_LIBRARY_PATH (or at least math sure that the correct OpenCL.so libs are found first.).
It looks like you built DPC++ with a CUDA version newer than your driver version. See here https://github.com/intel/llvm/blob/sycl/sycl/plugins/unified_runtime/ur/adapters/cuda/device.cpp#L983C29-L983C29
Since CUDA toolkits don't have compatibility across major versions with the driver, you will need to build with a CUDA 11 toolkit if you want to keep your driver as CUDA 11. The issue that you have with the missing CU_AD_*
is a bug which is hopefully fixed here https://github.com/intel/llvm/pull/10691
Can you checkout the branch in the PR and see if you can successfully build DPC++ with CUDA 11.x?
@hdelan There are still a couple left that didn`t get fixed with the above PR: https://github.com/hdelan/llvm/blob/sycl/sycl/plugins/unified_runtime/ur/adapters/cuda/image.cpp#L127-L130
Thanks @abagusetty here's the new patch https://github.com/intel/llvm/pull/10922
Closed by https://github.com/intel/llvm/pull/10922
Describe the bug I was getting an error while building a sycl with NVidia CUDA
I followed the procedure mentioned on getting started guide
but when i ran compile.py file
i am getting following error
In my case , the CUDA is not installed on standard path, so as mentioned on getting started i didnt added it to path and ld_library_path