Closed EricLBuehler closed 4 months ago
As described. The speculative decoding implementation is working, but should be sped up.
Work in #296.
As described. The speculative decoding implementation is working, but should be sped up.