Bubble up model loading errors to user

psugihara / FreeChat

llama.cpp based AI chat app for macOS

https://www.freechat.run

MIT License

425 stars 37 forks source link

Bubble up model loading errors to user #41

Closed verhovsky closed 9 months ago

verhovsky commented 9 months ago

If you load a model that's bigger than your RAM (starling-lm-7b-alpha.Q6_K.gguf on my 8GB Mac), and ask it to generate text, it hangs for a while then crashes the app. This way it should at least show an error message to the user (don't mind the tab bar, idk how that got there. It somehow appeared when I was trying to modify the tab hiding code and now I can't get rid of it, but it's not from this change):

vercel[bot] commented 9 months ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated (UTC)
free-chat	✅ Ready (Inspect)	Visit Preview	💬 Add feedback	Dec 19, 2023 1:22am

psugihara commented 9 months ago

cool, makes sense. better than crashing! it would be great to somehow capture the actual errors or server logs so users could report and/or understand them.

verhovsky commented 9 months ago

It seems like it crashes starting the server here because the llama process crashes:

https://github.com/psugihara/FreeChat/blob/13d0d3e627f8b96b3ea2550ae808c517e53452bd/mac/FreeChat/Models/NPC/LlamaServer.swift#L281