Closed daviswer closed 9 months ago
Add support for speculative decoding architecture, with implementations of distinct parallel pretraining and generative inference forward passes.
I made the Blacking and Snake_Casing changes but it looks like a maintainer needs to approve the automated tests again
Add support for speculative decoding architecture, with implementations of distinct parallel pretraining and generative inference forward passes.