Vision models fail to process images

lmstudio-ai / lmstudio-bug-tracker

Bug tracking for the LM Studio desktop application

10 stars 3 forks source link

When loading an image into a vision model, the LLM will reply with some version of "there is no image" or "As an AI, I can only work with text". This happens even though the model has vision capabilities and LM Studio correctly detected that and surfaced the image upload button (not the clip/attachment button).

I am seeing this behavior no matter what my system prompt is (even empty). It also surfaces for both MLX and GGUF filetypes.

LM Studio version: 0.3.4 Hardware: 14" M3 Max 128Gb RAM

Example image I've been using: cover-i-think-the-callback-to-supa-hot-fire-on-the-v0-p89xtdx3oema1

Examples of output:

Please note the system prompt in this image was simply a test to try to force it after an empty system prompt did not work.

lmstudio-ai / lmstudio-bug-tracker

Vision models fail to process images #152