Open qnguyen3 opened 10 months ago
Same question here
It's not the top priority, but we would do that.
please make it the priority @simonJJJ
@simonJJJ can you tell us otherwise how to increase throughput on qwen-vl-chat-int4 any optmization techniques please
I’m also interested
Are we going to get GGML/GGUF version of Qwen-VL?