Open zjin-lcf opened 1 year ago
This would require an extension for a device property that can query for CU_DEVICE_ATTRIBUTE_MAX_THREADS_PER_MULTIPROCESSOR in pi_cuda.cpp
The device property may be max work items per compute unit. This property may be supported by Intel GPUs too.
https://github.com/oneapi-src/SYCLomatic/issues/496
The property is commonly used in GPU programs written by researchers https://userweb.cs.txstate.edu/~burtscher/publications.html
@dm-vodopyanov Did you have any updates about this ?
Please see the code sample in CUDA. Users are not sure if they could compute the value in SYCL using an alternative way. Thanks.