Similar to #301 , in this PR we remove page_size from template parameters so that we can support any page_size for prefill kernels (previously we only support something like 1,4,8,16,32), as well as reduce binary size and accelerate compilation time.
Similar to #301 , in this PR we remove
page_size
from template parameters so that we can support anypage_size
for prefill kernels (previously we only support something like 1,4,8,16,32), as well as reduce binary size and accelerate compilation time.