Open windkwbs opened 1 month ago
Use --multiuser
flag
--multiuser After checking, only queue execution is allowed and multiple tasks cannot be performed simultaneously
Yes, they will be executed in sequence. Koboldcpp does not allow parallel decoding.
Multi-task concurrency function is required, which is very important for API users. Similar to ollama function has been implemented