Closed frankxyy closed 8 months ago
The specdec results here are just using out of the box llama-7B.
The specdec results here are just using out of the box llama-7B.
So is there an average speculative decoding length that can pass the verification process?
Hi, I am wondering about the training process of the small model and the verification accuracy. As it has large effects on the decoding effectiveness. Thank you!