Closed CluelessDuck closed 7 years ago
there is a cast from float2 to float4. This cast is not used in every length. So some pass and some fail. It seems that ARM runtime does not support this cast.
I don't see a cast? The code seems to be simply trying to assign a float2 to a float4. It should complain about that without an explicit cast.
this was not a problem with other opencl compilers that just raises a warning; looks like in your case it fails with error, i have checked in change with explicit cast in develop branch 0f8fe79 let me know if that fixes the issue for you
Yes, this is not an explicit cast. We do it explicitly in rocFFT.
Thank you! This resolved the issue.
When running clFFT-client on a embedded System (Phytec Phycore-RK3288) with Yocto Linux the program crashes when certain FFT lengths are used. The program crashed when a FFT Length of 1024, 2048 or 4096 was used. A length of 256, 512 worked without problem and with a FFT Length of 16384 the Execution wall time and Execution gflops were negative. When I used a real Input (--inLayout 5) and Hermitian_Interleaved Output ( --outLayout 3 ) the Error didn't occur.
My own implementation of an FFT with clFFT lead to the same Problem when using a Complex_Interleaved in- and output. When I used a real input my implementation did (mostly) work on the System. In my Implementation the Error occurs during the execution of clFFTBakePlan().
The FFT sizes which I wanted to use on the embedded System are exactly those which don’t work( 2048C and 4096C).
The clinfo output of my System is:
The error message is:
I am Sorry if this is not the place for this Bugreport or if it is an error on my side.