Closed AlexBrownAMD closed 4 months ago
Is there any document which explains how this works? It is difficult to understand the behavior from asm code. Uploading to the corresponding ticket or any way is OK.
Is there any document which explains how this works? It is difficult to understand the behavior from asm code. Uploading to the corresponding ticket or any way is OK.
Longer description added to the ticket
Alternative implementation of the 2-tile algorithm that does DP tiles first and SK tiles after. This method should have a small boost in performance.