I have encountered model loading failures 2 times with timeout errors when attempting to load models. The request to POST http://127.0.0.1:39291/v1/models/start times out consistently. Not with just Llama 8B, but also 3B.
Model loading requests are timing out during initialization with the error:
Request timed out: POST http://127.0.0.1:39291/v1/models/start
Jan version
0.5.8-731
Describe the Bug
I have encountered model loading failures 2 times with timeout errors when attempting to load models. The request to POST http://127.0.0.1:39291/v1/models/start times out consistently. Not with just Llama 8B, but also 3B.
Model loading requests are timing out during initialization with the error:
Server logs:
Steps to Reproduce
Screenshots / Logs
cortex.log app.log
OS: macOS (Darwin Kernel Version 23.2.0) Hardware: Apple M2 Jan Version: v0.5.8-731 Memory: 16GB Total Model: Llama 3.1 8B Instruct Q4 Cortex Version: v1.0.3-rc5
What is your OS?