Closed jnchman closed 1 year ago
Adds new GPU layer offload argument available in the latest version of llama.cpp
just did this! and added support for shortform flag too (ngl). thanks! (https://github.com/keldenl/gpt-llama.cpp/commit/942ed88773a9eb7cc8543d6d85a69695db893fc1)
ngl
Adds new GPU layer offload argument available in the latest version of llama.cpp