jxiw / MambaInLlama

[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
https://arxiv.org/abs/2408.15237
Apache License 2.0
177 stars 13 forks source link

Where is the code of Speculative Decoding? #17

Closed cyj95 closed 1 day ago

cyj95 commented 5 days ago

I can’t find. Can you pinpoint the code?

jxiw commented 4 days ago

Hi, thank you for this question. Unfortunately, i don't have access to that. I work on distillation part. You can try to send an email to Daniele Paliotta for that. thanks.

itsdaniele commented 1 day ago

https://github.com/itsdaniele/speculative_mamba