Open benvanwerkhoven opened 8 years ago
The block-tiled implementation enables data reuse in GPU memory, but we can also reuse data on-chip. The next step is to write a kernel that does exactly this.
The block-tiled implementation enables data reuse in GPU memory, but we can also reuse data on-chip. The next step is to write a kernel that does exactly this.