vision-transformers Search Results

1000+ results
for vision-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

QwenLM/Qwen2-VL #265

QWEN2 VL forward function extremely slow for videos as short…

` from transformers import Qwen2VLForConditionalGeneration, AutoTokenizer, AutoProcessor from qwen_vl_utils import process_vision_info import torch model = Qwen2VLForConditionalGeneration.from_…

snpushpi updated 1 week ago
2
vllm-project/vllm #8408

[Bug]: The accuracy of vllm-Qwen2-VL-7B-Instruct is low.

### Your current environment from PIL import Image from transformers import AutoProcessor from vllm import LLM, SamplingParams from qwen_vl_utils import process_vision_info MODEL_PATH = '/w…

xiangxinhello updated 2 weeks ago
20
NSTiwari/Fine-tune-IDEFICS-Vision-Language-Model #2

TypeError: zip() takes no keyword arguments

Hi, I am getting following error on execute trainer.train() --------------------------------------------------------------------------- TypeError Traceback (most re…

feisuo updated 1 week ago
1
NVlabs/VILA #109

Issue with Flash Attention on V100 GPU for Llama-3-VILA1.5-8…

Hi, I am encountering an issue when running inference on the Llama-3-VILA1.5-8B model. The error message I receive is: ```RuntimeError: FlashAttention only supports Ampere GPUs or newer.``` I…

vedernikovphoto updated 4 weeks ago
8
facebookresearch/ToMe #2

Adding support for HuggingFace vision Transformers

Hi, Thanks for this great work! In 🤗 Transformers, we support the [Vision Transformer (ViT)](https://huggingface.co/docs/transformers/model_doc/vit) - among many other models like [MAE](https://…

NielsRogge updated 1 year ago
3
ouusan/some-papers #1

Advancing Vision Transformers with Group-Mix Attention (---E…

1.Public code and paper link: I have installed the following code: [https://github.com/AILab-CVC/GroupMixFormer](url) paper link : [https://arxiv.org/abs/2311.15157](url) 2. What does this work d…

ouusan updated 6 months ago
2
girlscript/winter-of-contributing #5529

Data Science with Python : Vision Transformers

Description Welcome to the 'DSWP' Team, good to see you here. With this issue, readers will get introduced to the core information about 'Vision Transformers' along with sample code completely in …

prathimacode-hub updated 2 years ago
6
QwenLM/Qwen2-VL #62

An error occurred: The checkpoint you are trying....

Hello, I am receiving this error: **_### "An error occurred: The checkpoint you are trying to load has model type `qwen2_vl` but Transformers does not recognize this architecture. This could be be…

vivekvp1 updated 2 weeks ago
6
NVIDIA/TensorRT-LLM #2250

[issue] C++ runtime support multimodal model llava-one-visi…

How to support the new model in cpp runtime ? Is there any reference document ? For example, the multimodal model [llava-one-vision](https://huggingface.co/lmms-lab/llava-onevision-qwen2-7b-ov) Foll…

deepindeed2022 updated 4 days ago
1
GaParmar/img2img-turbo #89

ValueError: Unrecognized model in /root/autodl-tmp/img2img-t…

My server cannot connect to the Hugging Face website, so I manually downloaded the pretrained model used in the code and placed it in the `img2img-turbo-main` folder. After executing the command `pyth…

YijiFeng updated 1 week ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for vision-transformers

1000+ results
for vision-transformers