Closed JDTruj2018 closed 8 months ago
Unnecessary checks on (j < pattern_length) in the CUDA kernels, since j is calculated from j = total_id % pattern_length.
j = total_id % pattern_length
The only check necessary in all CUDA kernels is if (i < count) since i = total_id / pattern_length can be larger than count.
if (i < count)
i = total_id / pattern_length
count
Unnecessary checks on (j < pattern_length) in the CUDA kernels, since j is calculated from
j = total_id % pattern_length
.The only check necessary in all CUDA kernels is
if (i < count)
sincei = total_id / pattern_length
can be larger thancount
.