Closed PeiqinZhuang closed 1 year ago
Hello, Thank tou for your interest.
We fixed the dimension for our purposes to get a minor speed improvement. You can modify the line #define DIM 32
and change it to 64 and recompile if you're interested in doing that.
We plan to release either a separate version of the kernel with dynamic dims, or merge that into the kernel directly.
Hi, I want to double check that no matter the value of dim and num_heads, the dim of head is always 32?
Hi, No, we specifically kept the per-head dim at 32 for our 4 variants, and extended heads for larger variants. That's why we kept it fixed in the kernel.
Just an update, You can now use arbitrary dims per head with v0.11 (PR #23 )
@PeiqinZhuang If that resolves your question, feel free to close the issue.
Just an update, You can now use arbitrary dims per head with v0.11 (PR #23 )
@PeiqinZhuang If that resolves your question, feel free to close the issue.
Hi, I have one question. Should I change the block size from 32 to 64, if I change the default dimension from 32 to 64.
Sorry, to what exactly are you referring by block size?
Closing this due to inactivity. If you still have questions feel free to open it back up.
Hi, I notice that right now the dimension of the head is fixed as 32 because of the constraint of the Cuda kernel. I wonder what if I change the dimension of the head to 64 since that figure in some codebases is set as 64.