Why does the image reverse prompt word appear to be inconsistent with the actual image?

if-ai / ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.

https://ko-fi.com/impactframes

366 stars 27 forks source link

Why does the image reverse prompt word appear to be inconsistent with the actual image? #41

Closed shumoLR closed 1 month ago

shumoLR commented 1 month ago

QQ图片20240525233621 Hello, in the process of image to text, the text content does not match the image, please look at the image below, where do I set the problem?

if-ai commented 1 month ago

yes you need to use a llava model or other model that supports vision like the ones suggested on the repo readme ollama run llava:7b-v1.6-mistral-q5_K_M

shumoLR commented 1 month ago

yes you need to use a llava model or other model that supports vision like the ones suggested on the repo readme ollama run llava:7b-v1.6-mistral-q5_K_M

Ok, thank you very much. I'll try