if-ai / ComfyUI-IF_AI_tools

ComfyUI-IF_AI_tools is a set of custom nodes for ComfyUI that allows you to generate prompts using a local Large Language Model (LLM) via Ollama. This tool enables you to enhance your image generation workflow by leveraging the power of language models.
366 stars 27 forks source link

Why does the image reverse prompt word appear to be inconsistent with the actual image? #41

Closed shumoLR closed 1 month ago

shumoLR commented 1 month ago

QQ图片20240525233621 Hello, in the process of image to text, the text content does not match the image, please look at the image below, where do I set the problem?

if-ai commented 1 month ago

yes you need to use a llava model or other model that supports vision like the ones suggested on the repo readme ollama run llava:7b-v1.6-mistral-q5_K_M

shumoLR commented 1 month ago

yes you need to use a llava model or other model that supports vision like the ones suggested on the repo readme ollama run llava:7b-v1.6-mistral-q5_K_M

Ok, thank you very much. I'll try