accuracy of next-token and next-next-token

hemingkx / Spec-Bench

Spec-Bench: A Comprehensive Benchmark and Unified Evaluation Platform for Speculative Decoding (ACL 2024 Findings)

Apache License 2.0

166 stars 16 forks source link

Thanks for your issue!

We plan to support the statistics of Token acceptance rate in the near future, as noted in our roadmap. The implementation presents two main challenges:

Draft Length Variability: Each method has a different draft length, requiring specific modifications and examination of the source code for each method. This brings a considerable workload.
Token Tree Drafts: As mentioned in the Eagle paper, token acceptance rate statistics are less applicable for token tree drafts because multiple tokens are sampled per location with only one accepted. Therefore, an appropriate evaluation framework should be designed to unify the statistical evaluation of various methods.

We are working on this and will provide updates soon. Stay tuned!

hemingkx / Spec-Bench

accuracy of next-token and next-next-token #8