-
Dear Yuhui Zhang,
Thank you for your great effort! I found your paper very interesting and informative.
Could you provide us with the pretrained weights for fine-tuned VLMs? It would be a tremen…
-
- [x] MiniCPM-Llama3-V-2_5
- [x] Florence 2
- [ ] MoonDream2
- [ ] Yi-VL
- [x] Llava Next
- [ ] CuMo
- [ ] Kosmos-2.5
Instructions:
1. Select the model and comment below with your selection
…
-
Thanks for sharing your great work!
I have a few questions about your work, especially regarding the baselines.
1. Did you fine-tune the VLMs reported in Table 1? I got confused because Section 3.…
-
Hi, does the AWQ algorithm look at activation based on a prompt dataset? If yes, wouldn't VLMs be inaccurate because of missing vision embedding?
-
### Feature request
Add support for export SigLIP models
### Motivation
As used by many SOTA VLMs, SigLIP is gaining traction and supporting it can be the step 1 to supporting many VLMs.
### Your …
-
### Model description
Hi! I'm the author of ["Prismatic VLMs"](https://github.com/TRI-ML/prismatic-vlms), our upcoming ICML paper that introduces and ablates design choices of visually-conditioned …
siddk updated
1 month ago
-
For anyone that has gotten VLMs to work in fastchat. How did you do so? I cannot even pull any llava model from hugging face successfully. These have been my results so far:
```
python -m fastchat.s…
-
文档里在评测模块提到支持两种pattern的评测集:选择题格式的CEval和问答题格式的General-QA,General-QA的格式暂不包含对图片的支持,请问后续计划加入VLM的评测功能吗
Wimen updated
2 weeks ago
-
Both are modern performant models, and would be very useful for internal use due to their licenses.
https://huggingface.co/tiiuae/falcon-11B-vlm
renos updated
3 weeks ago
-
timm v1.0.3 was just released 2 hours ago (https://github.com/huggingface/pytorch-image-models/releases/tag/v1.0.3) and it seems like they've reworked the API for `forward_intermediates()` and it retu…