-
Following set up in https://github.com/open-compass/VLMEvalKit/tree/main for Qwen set up.
Qwen-VL-Chat directly outputs answer instead of a letter choice. Did you use any customized prompt or did p…
-
Thank you for your wonderful work! I have been following the demo instructions and successfully launched the controller and the Gradio web server. However, I encountered an issue when trying to launch…
-
Model name: LLaVA-NeXT-Video-7B
llava/model/llava_arch.py", line 309, in prepare_inputs_labels_for_multimodal
image_feature = unpad_image(image_feature, image_sizes[image_idx])
TypeError: 'No…
-
https://github.com/LLaVA-VL/LLaVA-NeXT/blob/main/scripts/train/finetune_onevision.sh
Is this the script for SFT? where can we find the folloing checkpoint for finetuning?
`"/checkpoints/projecto…
-
Hello. Thanks for your excellent work!
Earlier, I reproduced LLaVA-NeXT-Image training and got the desired performance, and I am now trying to reproduce LLaVA-NeXT-Interleave training. I would like…
-
If I want to work with multimodal LLMs that takes in a set of embedding from vision/audio encoders, what is the proper way of inputting them into a LLM running using exllamav2?
Can I just add a custo…
-
After running all the necessary commands, the demo doesn't display the chat interface as expected. It appears to be stuck, and the chat UI is not visible despite the code containing the relevant imple…
-
不是引流,只是考虑到可能大家会有些不构成 issue 的小问题,有个群会比较好。 后续如果官方有需要,我愿意转让群管理
我的微信 dreamingforhope ,若二维码失效可添加我
-----------------------------------
一群已满200人,新开一个二群
![image](https://github.com/user-attachments/as…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related issue y…
-
Where is the llava_1_6.json referred to in the `scripts/train/finetune_clip.sh`