NVIDIA / CUDALibrarySamples

CUDA Library Samples
Other
1.5k stars 311 forks source link

Correct cufft_utils scaling_kernel indexing #196

Open icyveins7 opened 2 months ago

icyveins7 commented 2 months ago

The scaling_kernel was clearly designed to be a grid-stride kernel to multiply all elements by a constant; this correction makes it so.

While the examples currently build and run correctly as-is, if the parameters are changed (like in 1d_r2c_c2r) to be a larger value, it becomes highly likely that the output will be wrong, as only a few elements will be 'scaled' multiple times.