Closed al42and closed 1 year ago
Hello!
Thanks for pointing this out. I was exploring the addressing optimisations that could be made for various compilers and missed up on the case of R2C/C2R, where it was not applicable. Should be fixed now.
Best regards, Dmitrii
Thanks for the quick fix!
Doing a roundtrip for a 5x5x10 3D R2C transform, the last XY-row is not reconstructed correctly:
Other values are within 1e-5 abs. error.
Code: https://gist.github.com/al42and/c5b1cf3afe261585102971579c851e42
Tested with: ROCm 5.3.3 on MI250X (gfx90a), ROCm 5.4.1 on MI50 (gfx906), ROCm 5.4.2 on RX 6400 (gfx1034).
The same code work well when using
master
(13005671b20956983128003d3747b0529f4ded9a) version of VkFFT.Bisection leads to 8ec6867504f1e9a3e87db58f2d0c6bc512ad11fc.