Closed nomaddo closed 6 years ago
You may check this
https://github.com/doe300/VC4C/issues/30
There are issues in the implementation. I tried to help, however my knowledge about VideoCore programming is rather limited.
Can you also re-check with the latest VC4CL?
Thanks. The output became correct.
I am trying to check VC4CL works correctly. In the following code, the output is differenct from what I expected and the one of NVIDIA env.
kernel
host code can found in https://github.com/nomaddo/opencl-benchmark/blob/master/gpu.c . This host code just take kernel file name, function name, num of args, and the length of each argument.. Then execute the kernel, and print all arguments.
The expected result (and the one of NVIDIA GPU) is as follows:
I use the latest of
VC4C
built by circleci, and self-compiledVC4CL
, which also come from latest source-code.