janhq / cortex.cpp

Local AI API Platform
https://cortex.so
Apache License 2.0
2.09k stars 119 forks source link

bug: Concurrent chat doesnt work on Mac Silicon #1569

Closed gabrielle-ong closed 1 week ago

gabrielle-ong commented 2 weeks ago

Cortex version

1.0.1-203

Describe the Bug

Mac: Concurrent chats for the same model are queued up rather than parallel

May be related to n_parallel parameter in model.yaml

Windows, Ubuntu: Working as expected

Steps to Reproduce

No response

Screenshots / Logs

No response

What is your OS?

What engine are you running?

gabrielle-ong commented 2 weeks ago

@vansangpfiev do I need to change anything for this to work? I redownloaded the models, but still is non concurrent on my local com and the VM test-macos-13-1 ie right chat finishes, only then left chat begins Image

gabrielle-ong commented 1 week ago

works with n_parallel = 2, marking as complete