Infini-AI-Lab / Sequoia

scalable and robust tree-based speculative decoding algorithm
280 stars 29 forks source link