AutonomicPerfectionist / PipeInfer

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation
MIT License
10 stars 3 forks source link