Open GoogleCodeExporter opened 8 years ago
Addition to problem (1) above: there is indeed loss of accuracy with clAmdFft,
when its internal tests fail.
In particular, running ADDA with '-grid 10' produces the same result for Cext -
28.10875812 for sequential and both OpenCL versions (using Apple clFFT and
clAmdFft). Which correlates with test pass for FFT size 20.
However, for '-grid 16' the result is 135.0449041, 135.044904, and 135.0450408
respectively. This correlates with test fail for FFT size 32.
This loss of accuracy can become more significant for longer runs (with slower
convergence of the iterative solver).
Original comment by yurkin
on 30 Jan 2013 at 6:53
I have just tested clAmdFft 1.10.274 on Windows. There seems to be no changes
in the obtained results and with respect to all the bugs described above. The
only difference is that 32-bit compilation on 64-bit Windows crashes with -grid
16 (still working fine with -grid 6,10, and 14).
It still interesting to look in details whether the new release of clAmdFft
contains any new features. In particular, it seem to be contain a flag for
faster (but potentially less accurate transforms).
Original comment by yurkin
on 15 Apr 2013 at 1:57
Original comment by yurkin
on 3 Aug 2014 at 4:51
Original comment by yurkin
on 3 Aug 2014 at 4:59
Original issue reported on code.google.com by
yurkin
on 20 Jan 2013 at 5:29