Closed KexinFeng closed 3 months ago
Once you get your acceptance rate, I think you can directly calculate the theoretical best gamma and speedup for vanilla speculative decoding.
Hello, you can use 4-chain, 8-chain in L40_growmaps as growmaps, which will produce "tree structures" for vanilla speculative decoding. Thank you!
Hi,
Thanks for the great work!
I'm wondering if there is any benchmark that compares Sequoia against vanilla speculative decoding?