Open ByronHsu opened 3 weeks ago
https://github.com/linkedin/Liger-Kernel/tree/main/examples/medusa
With the implementation of FusedLinearCrossEntropy and other kernels in Liger-Kernel, we are able to effectively reduce the memory while increase the throughput. We are happy to collaborate and integrate with our kernels!
cc @ctlllll @leeyeehoo @zhyncs
https://github.com/linkedin/Liger-Kernel/tree/main/examples/medusa
With the implementation of FusedLinearCrossEntropy and other kernels in Liger-Kernel, we are able to effectively reduce the memory while increase the throughput. We are happy to collaborate and integrate with our kernels!