Closed nandor closed 1 week ago
Since o_frag was not always aligned to a 16-byte boundary, memcpy implemented using 4x float moves was crashing in cuda-gdb when compiled with -G.
o_frag
memcpy
cuda-gdb
-G
Since
o_frag
was not always aligned to a 16-byte boundary,memcpy
implemented using 4x float moves was crashing incuda-gdb
when compiled with-G
.