vision-language-model Search Results

1000+ results
for vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

turboderp/exllamav2 #658

[REQUEST] Llama 3.2 Vision Support (or already exists?)

### Problem Wondering if basic support already exists. Llama vision 3.2 is unlike https://github.com/turboderp/exllamav2/issues/399, and in some ways may be very easy for basic Exllama integration…

grimulkan updated 1 week ago
13
vllm-project/vllm #6682

[Bug]: CUDA OOM error when loading another model after exiti…

### Your current environment ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A …

R-C101 updated 4 weeks ago
5
huggingface/optimum-benchmark #295

Vision language model support

Hello! 💗 When trying to run benchmarks on vision language models (image-text-to-text) I realized this library doesn't support this task. It would be nice to have a support for it since these models ar…

merveenoyan updated 5 days ago
1
google-ai-edge/mediapipe #5690

GPU mode (all tasks) fails to initialize on Nvidia Jetson (a…

### Have I written custom code (as opposed to using a stock example script provided in MediaPipe) Yes ### OS Platform and Distribution Ubuntu 22.04, arm64, Jetpack 6.0, CUDA 12.2 ### Progr…

JC3 updated 1 month ago
3
thu-ml/RoboticsDiffusionTransformer #5

How to run inference of the model with a single image and no…

Hi there! Thank you for your great research and open-source contributions. I just have a few questions about running your model. ## What I am trying to do I am trying to run RDT on the [Sim…

alik-git updated 1 month ago
7
huggingface/trl #2097

Supports of PPOTrainer / DPOTrainer for Qwen2Audio

### Feature request Enable PPOTrainer and DPOTrainer to work with audio-language models like Qwen2Audio. Architecture for this model is identical to vision-language models like LlaVa, consisting of…

jonflynng updated 1 month ago
2
huggingface/trl #2136

[SFT VLM] Add support for Molmo models

### Feature request Extend the `sft_vlm.py` script to support the new Molmo models from AllenAI: https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19 Paper: https://arxiv.org/…

lewtun updated 1 month ago
15
vllm-project/vllm #4194

[RFC]: Multi-modality Support Refactoring

[[Open issues - help wanted!]](https://github.com/vllm-project/vllm/issues/4194#issuecomment-2102487467) **Update [11/18] - In the upcoming months, we will focus on performance optimization for mul…

ywang96 updated 11 hours ago
91
huggingface/transformers #34467

Assert error in convert_llava_onevision_weights_to_hf.py

### System Info - `transformers` version: 4.46.0 - Platform: Linux-5.15.0-97-generic-x86_64-with-glibc2.35 - Python version: 3.12.3 - Huggingface_hub version: 0.26.1 - Safetensors version: 0.4.…

FuryMartin updated 2 days ago
20
merplumander/ai-forecasting #1

Language Models and Knowledge Cut-offs

# Language Model Overview ## OpenAI | | gpt-4o | gpt-4o-mini …

merplumander updated 1 week ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for vision-language-model

1000+ results
for vision-language-model