Closed yzh119 closed 13 hours ago
gemma-style rmsnorm kernels (introduced in #477 ) are similar to original rmsnorm kernel, and we should use the same kernel for them. This PR cleans up duplicate code and unifies the kernels for gemma-style and original rmsnorm kernels.
The precision improvements (https://github.com/flashinfer-ai/flashinfer/pull/587, https://github.com/flashinfer-ai/flashinfer/pull/592) are kept in this PR.
gemma-style rmsnorm kernels (introduced in #477 ) are similar to original rmsnorm kernel, and we should use the same kernel for them. This PR cleans up duplicate code and unifies the kernels for gemma-style and original rmsnorm kernels.
The precision improvements (https://github.com/flashinfer-ai/flashinfer/pull/587, https://github.com/flashinfer-ai/flashinfer/pull/592) are kept in this PR.