-
### Problem
Wondering if basic support already exists.
Llama vision 3.2 is unlike https://github.com/turboderp/exllamav2/issues/399, and in some ways may be very easy for basic Exllama integration…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
Hello! 💗 When trying to run benchmarks on vision language models (image-text-to-text) I realized this library doesn't support this task. It would be nice to have a support for it since these models ar…
-
### Have I written custom code (as opposed to using a stock example script provided in MediaPipe)
Yes
### OS Platform and Distribution
Ubuntu 22.04, arm64, Jetpack 6.0, CUDA 12.2
### Progr…
-
Hi there!
Thank you for your great research and open-source contributions.
I just have a few questions about running your model.
## What I am trying to do
I am trying to run RDT on the [Sim…
-
### Feature request
Enable PPOTrainer and DPOTrainer to work with audio-language models like Qwen2Audio. Architecture for this model is identical to vision-language models like LlaVa, consisting of…
-
### Feature request
Extend the `sft_vlm.py` script to support the new Molmo models from AllenAI: https://huggingface.co/collections/allenai/molmo-66f379e6fe3b8ef090a8ca19
Paper: https://arxiv.org/…
-
[[Open issues - help wanted!]](https://github.com/vllm-project/vllm/issues/4194#issuecomment-2102487467)
**Update [11/18] - In the upcoming months, we will focus on performance optimization for mul…
-
### System Info
- `transformers` version: 4.46.0
- Platform: Linux-5.15.0-97-generic-x86_64-with-glibc2.35
- Python version: 3.12.3
- Huggingface_hub version: 0.26.1
- Safetensors version: 0.4.…
-
# Language Model Overview
## OpenAI
| | gpt-4o | gpt-4o-mini …