psugihara / FreeChat

llama.cpp based AI chat app for macOS
https://www.freechat.run
MIT License
425 stars 37 forks source link

Bump llama.cpp, -ngl 99, tokens/second fix #71

Closed psugihara closed 4 months ago

psugihara commented 4 months ago

see #70

On my mbp I see ~24 tokens/sec with a llama 8B_5_K_M and GPU and ~13 without GPU on.

Screenshot 2024-05-27 at 9 32 21 AM Screenshot 2024-05-27 at 9 35 16 AM
vercel[bot] commented 4 months ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
free-chat ✅ Ready (Inspect) Visit Preview 💬 Add feedback May 27, 2024 4:29pm