Closed justheuristic closed 4 months ago
This pull request is not to be merged; it is used in a discussion with @xtinkt to compare two implementations for speculative decoding. It will be closed in a few days.
This pull request is not to be merged; it is used in a discussion with @xtinkt to compare two implementations for speculative decoding. It will be closed in a few days.