bug: insufficient handling of insufficient memory

janhq / cortex.cpp

Run and customize Local LLMs.

https://cortex.so

Apache License 2.0

1.97k stars 111 forks source link

Open jlfranklin opened 2 weeks ago

jlfranklin commented 2 weeks ago

0.5.5

When there is insufficient memory to run the model, lot of errors are thrown into the logs, and the returned text is complete gibberish.

Jan should stop the model and say something like, "sorry, my brain is full."

imtuyethan commented 2 weeks ago

cortex.cpp team is working on this

0xSage commented 5 days ago

Needed: proper error handling when:

user attempts to load a model too big to fit in available memory
error message, e.g.: unable to load model due to insufficient system memory. xx needed. xx available.

Feel free to reassign @vansangpfiev and move to a different sprint