eyalroz / cuda-kat

CUDA kernel author's tools
BSD 3-Clause "New" or "Revised" License
104 stars 8 forks source link

append_to_global_memory() for disparate per-thread data? #76

Open eyalroz opened 4 years ago

eyalroz commented 4 years ago

I wonder if we should consider a version of append_to_global_memory() where each thread may have its data elsewhere (at an address); and perhaps also a version where each thread has some data that's guaranteed to be in registers (e.g. with capped common size so that we can use a kat::array perhaps)