-
Do you have data on the performance of DPO with models other than Qwen-VL-Chat? I found that it degrades both perception and cognition in MME when used with LLaVA-1.5.
-
### Feature request
Adding the ability to pass many images per prompt to PaliGemma. This would mean, among other changes, to change the argument type of `images` on PaliGemmaProcessor to allow array[…
-
![image](https://github.com/LLaVA-VL/LLaVA-NeXT/assets/55685981/3c9059ea-95be-41c7-a555-d2ab407f374f)
-
Hello, I have been try to use the chatbot_ros project with the explicability_ros project and decided that I wanted to try to get it to describe images. I chased my tail on trying to get a proper promp…
-
AttributeError: 'Image' object has no attribute 'shape'
-
Thanks for your great job!When will open source finetune code?
-
[[Open issues - help wanted!]](https://github.com/vllm-project/vllm/issues/4194#issuecomment-2102487467)
**Update [9/8] - We have finished majority of the refactoring and made extensive progress fo…
-
Hi, was just testing to see if I could reform the same results from your demo as in an import code. I was attempting to prompt two images and then ask for comparisons. The demo performs this very well…
-
### Describe the issue
Thanks for your great work!!
I want to load the model and input multiple images to save the description for each image.
Is there a way to load the model and then complete t…
-
加载模型后执行,第一次推理没问题,同样的参数第二次推理,就会报错
另外:咱们是否有官方技术交流群,可以讨论一下呢