srush / llama2.rs

A fast llama2 decoder in pure Rust.
MIT License
995 stars 54 forks source link

Attempt to go even faster #10

Closed mfuntowicz closed 10 months ago

mfuntowicz commented 10 months ago

Two main things I did:

mfuntowicz commented 10 months ago

_(Dont merge this as it as it would certainly breaks other than x8664 build due to prefetch)

srush commented 10 months ago

Closing because portable port moved in.