Can someone please answer this question, have you tried to run the ARM neon code on an iOS device? I am getting very incorrect values with ffts_init_1d_real forward transform starting with any signal size 16 and above. I am testing with i = i + 1 so 1,2,3,4... etc.
At this point I have tried this master branch and a few forks but it seems like no matter what I try there is something wrong with the results on ARM. The same exact code I have works perfectly with SSE but gives completely wrong results on ARM.
Can someone please answer this question, have you tried to run the ARM neon code on an iOS device? I am getting very incorrect values with ffts_init_1d_real forward transform starting with any signal size 16 and above. I am testing with i = i + 1 so 1,2,3,4... etc.
At this point I have tried this master branch and a few forks but it seems like no matter what I try there is something wrong with the results on ARM. The same exact code I have works perfectly with SSE but gives completely wrong results on ARM.