srush / llama2.rs

A fast llama2 decoder in pure Rust.
MIT License
1.01k stars 56 forks source link

SIMD Tweaks #11

Closed srush closed 1 year ago

srush commented 1 year ago

Pull in some of the save simd tweaks from: https://github.com/srush/llama2.rs/pull/10

gives a small speed boost.