The successful integration of Qwen2-VL-Instruct into the ComfyUI platform has enabled a smooth operation, supporting (but not limited to) text-based queries, video queries, single-image queries, and multi-image queries for generating captions or responses.
Qwen has released some quantized models
Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int4 Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int4 Qwen/Qwen2-VL-2B-Instruct-GPTQ-Int8 Qwen/Qwen2-VL-7B-Instruct-GPTQ-Int8
since they are pre-qunatized, it could save a lot of disk usage. Please added Qwen2-VL-GPTQ models support!!
Thankyou!!!