This PR encompasses several enhancements to usability and code refactoring. The primary changes include:
Skip compilation: You can skip compilation of llama.cpp shared library when running server with --install-pkgs. Just add --skip-compile option.
Removed auto process kill feature: Killing process when unloading model, was introduced to prevent the program from memory leak, but this sometimes make the program exit for no reason. So this feature is removed.
API key checker: API key checker will be activated if you start the server with option --api-key YOUR_API_KEY. Client must include Authorization header with Bearer YOUR_API_KEY.
This PR encompasses several enhancements to usability and code refactoring. The primary changes include:
--install-pkgs
. Just add--skip-compile
option.--api-key YOUR_API_KEY
. Client must includeAuthorization
header withBearer YOUR_API_KEY
.