Closed Answer-is-not-42 closed 7 months ago
I'm having the same issue, and I think it's because the image embeddings take the place of the system prompt or something. Also I'm not sure if it is directly related, but the image gets re-encoded and added to context everytime the user sends any message, without uploading the image again. I can't engage in a "conversation" about the image, only get the description once.
This issue was closed because it has been inactive for 14 days since being marked as stale.
Environment and Context
Windows 11, latest release of llamacpp, cu12.2
Failure Information
When using server web interface with multimodal model in multimodal mode (LLaVa 1.5 13B in my case), the promt template in the interface is not used. This does not happen when I don't send an image.
Also, prompts with images don't seem to be saving in cache? I'm not sure if it's related, or should I create another issue?
Steps to Reproduce
./server -m ./models/llava-13b-q4_K.gguf --mmproj ./models/mmproj-llava-13b-f16.gguf -ngl 20 -v
Failure Logs
Output from console when using multimodal capabilities (sending an image):
The defaut prompt is there. (Also, if I run the same prompt with the same image the image and prompt get reprocessed again.) Now, without sending an image, just asking the model to say "test":
No default prompt.