feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding
Apache License 2.0
530 stars 51 forks source link

add time benchmarking and organize the directory better #10

Closed feifeibear closed 1 year ago