basetenlabs / truss-examples

Examples of models deployable with Truss
https://trussml.com
MIT License
130 stars 37 forks source link

Medusa implementation #285

Closed aspctu closed 5 months ago

aspctu commented 5 months ago

This PR introduces a Truss that can run Medusa heads as part of TRT-LLM.