Added Llama 3 8B support

getumbrel / llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

https://apps.umbrel.com/app/llama-gpt

MIT License

10.53k stars 666 forks source link

Open Ualas opened 2 months ago

Ualas commented 2 months ago

This pull request introduces support for Llama 3 8B model with GGUF format. The high-level changes are:

Updated llama-cpp-python to v0.2.63. The previous version was causing the API to crash.
Changed DEFAULT_MODEL to Llama 3 8B.
Added API and UI compatibility to the Llama 3 8B model.

gyzerok commented 2 months ago

@Ualas would it make sense to add 70b as well?