vlms Search Results - Githubissues

250 results
for vlms

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #33374

Track progress for VLMs refactoring

This issue tracks the progress on improving the handling and testing of Vision-Language Models. The main goals are to enhance/enable generation tests, handle other generation techniques like assisted …

zucchini-nlp updated 1 week ago
1
run-llama/llama_index #16056

[Question]: How to use VLMs from HuggingFace for Multimodal …

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question I want to use "Qwen/Qwen2-VL-2B-Instruct" on my multimodal rag app. I tried OllamaMultiM…

g-hano updated 1 day ago
4
sgl-project/sglang #1129

[Feature] Support TRI-ML/prismatic-vlms

### Checklist - [X] 1. If the issue you raised is not a feature but a question, please raise a discussion at https://github.com/sgl-project/sglang/discussions/new/choose Otherwise, it will be closed.…

Depetrol updated 3 weeks ago
1
unslothai/unsloth #1020

Feature Request: Qwen2-VL support

When it will be possible to fine-tune Qwen2-VL (or other VLMs) using unsloth? :)

0xSt1ng3R updated 1 day ago
3
open-compass/VLMEvalKit #323

[Help Wanted] Supporting the `chat_inner` API for existing V…

Since we have now supported the multi-turn benchmark MMDU, we would like to implement the `chat_inner` function for existing VLMs in VLMEvalKit add support for multi-turn chatting. Currently, we hav…

kennymckormick updated 1 month ago
2
RobotecAI/rai #213

Key frames node for camera

**Is your feature request related to a problem? Please describe.** Vision Language Models are useful for understanding based on images. In robotics, the environment is dynamic and images from camera …

adamdbrw updated 1 week ago
2
RylanSchaeffer/AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer #25

Phi-3-Based VLMs Not Usable Possibly Due to Incorrect Model …

Hi, Thank you for your great work! I've been trying to use the Phi-3-Instruct-4B VLM models, but encountered several issues: - Incorrect LLM backbone choice in phi.py: https://github.com/R…

Qinyu-Allen-Zhao updated 1 week ago
1
huggingface/transformers #30638

Add Prismatic VLMs to Transformers

### Model description Hi! I'm the author of ["Prismatic VLMs"](https://github.com/TRI-ML/prismatic-vlms), our upcoming ICML paper that introduces and ablates design choices of visually-conditioned …

siddk updated 4 months ago
5
lm-sys/FastChat #3348

Cannot succesfully pull VLMs to fastchat

For anyone that has gotten VLMs to work in fastchat. How did you do so? I cannot even pull any llava model from hugging face successfully. These have been my results so far: ``` python -m fastchat.s…

PhilipAmadasun updated 3 months ago
5
huggingface/optimum #1897

Add support for export SigLIP models

### Feature request Add support for export SigLIP models ### Motivation As used by many SOTA VLMs, SigLIP is gaining traction and supporting it can be the step 1 to supporting many VLMs. ### Your …

aliencaocao updated 2 months ago
8

上一页 1...1 2 3 4 5 6 7...25 下一页

250 results for vlms

250 results
for vlms