Closed rasbt closed 1 month ago
Adds shared buffers to avoid recreating the mask and cos, sin values in each transformer block
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
Adds shared buffers to avoid recreating the mask and cos, sin values in each transformer block