Run any Llama 2 locally with gradio UI on GPU or CPU from anywhere (Linux/Windows/Mac). Use `llama2-wrapper` as your local llama2 backend for Generative Agents/Apps.
MIT License
1.97k
stars
202
forks
source link
safetensors_rust.SafetensorError: Error while deserializing header: HeaderTooLarge #80
Running
python app.py