Closed wphicks closed 3 years ago
Call cudaStreamSynchronize when syncing TritonTensor with output buffers in order to ensure correct output in cuda shared memory mode
cudaStreamSynchronize
TritonTensor
Resolve #80
Call
cudaStreamSynchronize
when syncingTritonTensor
with output buffers in order to ensure correct output in cuda shared memory modeResolve #80