My operating system is Centos7
I download and use quay.io/go-skynet/local-ai:master-cublas-cuda12
So, I installed CDUA on the operating system that matches this image
I user ggml-model-q4_0.gguf(Llama2-13B-chat)
yaml file in the model folder
I use Debug mode to run the image of LocalAI
An error occurred when I used Postman to send inference
The error message for LocalAI is as follows
I can reason normally by running llama.cpp separately in the container
But there was an error running the test under go lama
......
@deadprogram @mauromorales @jrc2139 @soleblaze
My operating system is Centos7
I download and use quay.io/go-skynet/local-ai:master-cublas-cuda12
So, I installed CDUA on the operating system that matches this image
I user ggml-model-q4_0.gguf(Llama2-13B-chat)
yaml file in the model folder
I use Debug mode to run the image of LocalAI
An error occurred when I used Postman to send inference
The error message for LocalAI is as follows
![image](https://github.com/ggerganov/llama.cpp/assets/9493473/602384e4-7b0e-4c72-a69a-8201b766aa91)
I can reason normally by running llama.cpp separately in the container But there was an error running the test under go lama
......
@deadprogram @mauromorales @jrc2139 @soleblaze