My best guess is that this is due to the number of OpenCL function variants defined since there is one for int8_t, int16_t, int32_t, etc. for every function.
Solution?
The best idea I've got to fix this is to generate and compile the kernels at runtime. This way, only the kernels that are used by the program are produced.
Issues:
Requires writing the code to generate every type of kernel
Requires runtime compilation of kernels (not a pre-main* process)
Might not fix the issue
I'll probably get test merged through to master in the current state as of writing this issue before working on this.
OpenCL on MacOS (M2 Max chip) seems to fail.
My best guess is that this is due to the number of OpenCL function variants defined since there is one for
int8_t, int16_t, int32_t, etc.
for every function.Solution?
The best idea I've got to fix this is to generate and compile the kernels at runtime. This way, only the kernels that are used by the program are produced.
Issues:
I'll probably get
test
merged through to master in the current state as of writing this issue before working on this.