Open rainwoodman opened 8 years ago
Sorry, I meant 1024 not 1020.
This is the way FFTW did it for years. But I do not have any problems with your suggestions. I'll give it a try in branch block_offset. It will be merged into master after some tests.
Thanks!
The local_start of an empty rank is always set to 0. This is causes unnecessary branching in downstream code. The logical model is simpler if we just think of these 'stencils' as with a size of zero, but offsetted the same way as others.
For example the local_i_start of a 3d r2c transform on a 2x53 domain decomposition(this set-up is sub-optimal) is currently:
I would suggest to change the last 0 to 1020.