feifeibear / LLMSpeculativeSampling

Fast inference from large lauguage models via speculative decoding
415 stars 46 forks source link

add share_gpt benchmarking results #20

Closed feifeibear closed 9 months ago