lacker / seticore

A high-performance implementation of some core SETI algorithms that can be included in other programs.
MIT License
3 stars 6 forks source link

Handle non-power-of-two num_timesteps #4

Closed lacker closed 2 years ago

lacker commented 2 years ago

At least some sample ATA data is 286 x 50331648 . I think we can avoid allocating extra GPU memory and just make the kernel accept a set size which is larger than num_timesteps and treat an out-of-bounds access as being a zero.

lacker commented 2 years ago

seems like it's working now, as of

https://github.com/lacker/seticore/commit/4694d894c5a45768762c327ed21d93c9adfb90b9