vision-transformer Search Results

1000+ results
for vision-transformer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e4exp/paper_manager_abstract #577

Long-Short Transformer: Efficient Transformers for Language …

- https://arxiv.org/abs/2107.02192 - 2021 トランスフォーマーは、言語領域と視覚領域の両方で成功を収めている。しかし、長い文書や高解像度の画像のような長いシーケンスに拡張するには、自己保持機構が入力シーケンスの長さに対して二次的な時間とメモリの複雑さを持つため、法外なコストがかかります。本論文では、言語タスクと視覚タスクの両方において、長いシ…

e4exp updated 3 years ago
2
intel-analytics/ipex-llm #12154

how to get llama-3.2-11B-vision-instruct to work ?

hi how to get llama-3.2 to work with ipex_llm ? here's my code. ``` import requests import torch from PIL import Image from transformers import MllamaForConditionalGeneration, AutoProcessor imp…

wallacezq updated 1 month ago
1
OFA-Sys/Chinese-CLIP #265

论文初始化image encoder的模型参数

准备复现ChineseClip论文，以CLIP-VIT-B/16 初始化image encoder部分，下载对应的是 https://huggingface.co/openai/clip-vit-base-patch16/tree/main 但是加载模型参数时，发现image encoder部分参数加载不上。我打印发现对应参数名称以vision_model.encoder.layers.开头…

gobigrassland updated 7 months ago
2
huggingface/transformers #34306

ValueError: Some specified arguments are not used by the HfA…

### System Info transformers version:'4.45.2' python version: 3.9.20 torch version: '2.4.1+cu124' ![image](https://github.com/user-attachments/assets/cb141f17-3482-462b-8184-7210f0a6c75e) ### W…

jiqibuaixuexi updated 5 days ago
5
ZZZ429/CC-DETR #2

train help

Traceback (most recent call last): File "I:/Code/CC-DETR-main/Networks/ALTGVT1.py", line 596, in model = alt_gvt_large(pretrained=True) File "I:/Code/CC-DETR-main/Networks/ALTGVT1.py", lin…

YZGod666 updated 1 month ago
2
huggingface/transformers #34654

The support of `Mllama` in AutoModel

### Feature request The `AutoModel.from_config` does not work with Mllama (MllamaConfig, MllamaVisionConfig). I would like to request the ability to use Mllama through `AutoModel`. ### Motivation T…

HanGyeol-Yoo updated 1 week ago
3
huggingface/transformers #33905

Implement LlamaGen for Image Generation

### Feature request Add support for LlamaGen, an autoregressive image generation model, to the Transformers library. LlamaGen applies the next-token prediction paradigm of large language models to vi…

ighoshsubho updated 4 weeks ago
12
lucidrains/vit-pytorch #222

Add another MLP head in vision transformer

How can I add/extend MLP head in same model for detection? Let's say head is detecting objects A,B,C in a image and we want to train by adding or extend MLP/classification head to detect objects D, E…

Atul997 updated 2 years ago
2
apple/ml-cvnets #95

Using vision transformers for different image resolutions

Hi, I ma working on using vision transformers not only the vanilla ViT, but different models on UMDAA2 data set, this data set has an image resolution of 128*128 would it be better to transform the im…

Oussamab21 updated 1 year ago
1
rosinality/vision-transformers-pytorch #1

To include other vision transformer models

Hi @rosinality, hope you are doing well! I really like your repo, especially for dataloader and augmentation part for image classification. I am not majorly working on Vision field but still I have …

rishikksh20 updated 3 years ago
1

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for vision-transformer

1000+ results
for vision-transformer