It just reverts the llama cpp python server bc the 0.2.79 is the last version that actually works fine with vulkan
Screenshot / video of UI
N/A
What issues does this PR fix or reference?
it resolves #40
How to test this PR?
run the latest version of the vulkan image ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat-vulkan:62b6f628ed77cf3f1518c32746e2e89d27072f0e and verify that it actually uses the cpu. The gpu detection is completely skipped.
You can use this command (update the model path)
2-b. if you do not want to build your own images you can use these below for testing using different version of llama_cpp
quay.io/lstocchi/vulkan:v4_279 -> llama_cpp 0.2.79
quay.io/lstocchi/vulkan:v4_280 -> llama_cpp 0.2.80
quay.io/lstocchi/vulkan:v4_284 -> llama_cpp 0.2.84
ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat-vulkan:62b6f628ed77cf3f1518c32746e2e89d27072f0e -> llamacpp 0.2.85
quay.io/lstocchi/vulkan:v4_287 -> llama_cpp 0.2.87
What does this PR do?
It just reverts the llama cpp python server bc the 0.2.79 is the last version that actually works fine with vulkan
Screenshot / video of UI
N/A
What issues does this PR fix or reference?
it resolves #40
How to test this PR?
ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat-vulkan:62b6f628ed77cf3f1518c32746e2e89d27072f0e
and verify that it actually uses the cpu. The gpu detection is completely skipped. You can use this command (update the model path)In the logs you should just have
2-b. if you do not want to build your own images you can use these below for testing using different version of llama_cpp
quay.io/lstocchi/vulkan:v4_279
-> llama_cpp 0.2.79quay.io/lstocchi/vulkan:v4_280
-> llama_cpp 0.2.80quay.io/lstocchi/vulkan:v4_284
-> llama_cpp 0.2.84ghcr.io/containers/podman-desktop-extension-ai-lab-playground-images/ai-lab-playground-chat-vulkan:62b6f628ed77cf3f1518c32746e2e89d27072f0e
-> llamacpp 0.2.85quay.io/lstocchi/vulkan:v4_287
-> llama_cpp 0.2.87