vision-language-model Search Results

1000+ results
for vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/vision #8435

Add vision-language models

### 🚀 The feature Add support for vision-language models like CLIP or LIT. ### Motivation, pitch Dear torchvision team, I am sorry if I missed discussions about this or a specific reason why you h…

trawler0 updated 1 month ago
1
InternLM/lmdeploy #1748

[Feature] Support for compact Vision-Language models

### Motivation Hi friends, I'm opening this issue as a place to discuss small vision-language models, please share your thoughts below! There's recently been great success in research with sm…

vody-am updated 3 weeks ago
3
guidance-ai/guidance #880

Input template for Transformers vision language models ?

Hi, I'm trying to constrain the generation of my VLMs using this repo; however i can't figure out the way to personalize the pipeline for handling inputs (query+image). Whereas it is documented as …

vpellegrain updated 3 weeks ago
4
paperswithlove/papers-we-read #31

What matters when building vision-language models?

## LINKs [[paper](https://arxiv.org/abs/2405.02246)](https://arxiv.org/abs/2405.02246) [[models](https://huggingface.co/HuggingFaceM4/idefics2-8b)](https://huggingface.co/HuggingFaceM4/idefics2-8b)…

runhani updated 1 month ago
1
huggingface/transformers #30565

Correct check for SDPA in Vision Language Models

### System Info In current implementation of VLMs, the "_supports_sdpa" attribute checks and activates SDPA attention only for the language model. For example in [Llava](https://github.com/huggingf…

zucchini-nlp updated 3 weeks ago
2
InternLM/lmdeploy #1514

[Bug] Issues Running Vision Language Models in Docker

### Checklist - [X] 1. I have searched related issues but cannot get the expected help. - [X] 2. The bug has not been fixed in the latest version. ### Describe the bug Hi folks, thanks for t…

ghost updated 3 weeks ago
8
microsoft/Phi-3CookBook #63

Vision fine-tuning

### This issue is for a: (mark with an `x`) ``` - [ ] bug report -> please search issues before submitting - [X] feature request - [ ] documentation issue or request - [ ] regression (a behavior …

2U1 updated 3 days ago
3
huggingface/trl #1784

Supports of SFTTrainer / PPOTrainer / DPOTrainer for LLaVA-a…

TRL SFTTrainer supports LLaVA (Large Language and Vision Assistant) as described in the following link [Vision Language Models Explained](https://huggingface.co/blog/vlms) Is there any plan to rele…

fangkuoyu updated 1 day ago
6
RLHF-V/RLAIF-V #11

The LoRA training codes and scripts

A significant achievement in aligning Vision-Language Models! While running the code 'RLAIF-V/muffin/train/train_llava15.py', I noticed that all model parameters are trainable. Due to hardware limi…

darkpromise98 updated 2 days ago
1
predibase/lorax #179

vision language model support

### Feature request The developments in the robotics community around RT-2 show a lot of potential for VLMs but the hardware constraints for small developers makes it difficult to deploy RT-2 level p…

7uk3y updated 5 months ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-language-model

1000+ results
for vision-language-model