FasterDecoding / Medusa

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads
https://sites.google.com/view/medusa-llm
Apache License 2.0
2.28k stars 154 forks source link

HYDRA support? #90

Open arunpatala opened 7 months ago

arunpatala commented 7 months ago

Hi,

Thanks on this great work. Is there any plan to support HYDRA model which builds on medusa (https://arxiv.org/pdf/2402.05109.pdf)