Closed MichaelJayW closed 2 months ago
@simon-mo is this a feature you'd like to see implemented?
Is there any plan for implementing this feature? Will it occur in Q2 roadmap?
Yes this is planned to happen. After the speculative decoding framework is in.
https://sites.google.com/view/medusa-llm