Open sergisiso opened 3 years ago
When the checksum is 0 it could be because it is using OpenCL 1.2 and the global_sizse is not a divisible exactly by the number of work sizes, which is a requirement no longer necessary in OpenCL > 2.0 .
Since OpenCL 1.2 is quite old and the issue can be easily resolve by executing the application with DL_ESM_ALIGNMENT=X to add enough elements to match the requirement I propose to leave it as it is. But it would be good to mention the issue/solution in the README of the relevant folder.
The NemoLite2D Fortran OpenCL manual implementation sometimes produce 0 checksum values. (this may be related to the invalid memory accesses due to sometimes accessing out of boundary values)
Also the OpenCL device is different, this has been observed in POCL and the AMD GPUs.