vision-transformers Search Results

1000+ results
for vision-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

gatheluck/PaperReading #315

[2022] How Do Vision Transformers Work?

## 論文リンク - [OpenReview](https://openreview.net/forum?id=D78Go4hVcxO) ## 公開日（yyyy/mm/dd） 2021/09/29 ## 概要 ## TeX ``` % yyyy/mm/dd @inproceedings{ park2022how, title={How Do Vi…

gatheluck updated 2 years ago
1
aws-neuron/neuronx-distributed #23

neuron-distributed for inference

Hi, I'm trying to make compatible a Clip model using neuron-distributed (because I'm gonna continue with a multimodal after it) Currently in my notebook, insidea inf2.xlarge ubuntu 22, I have: …

sonic182 updated 1 week ago
2
vllm-project/vllm #5820

[Feature]: Phi-3 vision -- allow multiple images as Microsof…

### 🚀 The feature, motivation and pitch i.e. instead of this: https://github.com/vllm-project/vllm/blob/main/vllm/entrypoints/openai/serving_chat.py#L138-L140 allow multiple images. Idea is …

pseudotensor updated 2 weeks ago
1
e4exp/paper_manager_abstract #650

Vision Transformers for Dense Prediction

- https://arxiv.org/abs/2103.13413 - 2021 密な予測タスクのバックボーンとして、畳み込みネットワークの代わりに視覚変換器を活用するアーキテクチャである「密な視覚変換器」を紹介します。視覚変換器の様々な段階で得られたトークンを様々な解像度の画像のような表現に組み立て、畳み込みデコーダを用いてフル解像度の予測に段階的に結合します。変換器のバックボ…

e4exp updated 2 years ago
2
e4exp/paper_manager_abstract #325

Vision Transformers for Dense Prediction

- https://arxiv.org/abs/2103.13413 - 2021 密な予測タスクのバックボーンとして、畳み込みネットワークの代わりに視覚変換器を活用するアーキテクチャである「密な視覚変換器」を紹介します。視覚変換器の様々な段階で得られたトークンを様々な解像度の画像のような表現に組み立て、畳み込みデコーダを用いてフル解像度の予測に段階的に結合します。変換器のバックボ…

e4exp updated 3 years ago
2
frankchieng/ComfyUI_Aniportrait #6

CLIPVisionModelWithProjection Shape Size Error

Unfortunately, running any of the example workflows I get the following error: ```bash Error occurred when executing AniPortrait_Pose_Gen_Video: Error(s) in loading state_dict for CLIPVisionMod…

RobeSantoro updated 1 month ago
2
huggingface/optimum #1897

Add support for export SigLIP models

### Feature request Add support for export SigLIP models ### Motivation As used by many SOTA VLMs, SigLIP is gaining traction and supporting it can be the step 1 to supporting many VLMs. ### Your …

aliencaocao updated 1 week ago
8
NielsRogge/Transformers-Tutorials #179

'transformers.models.vision_encoder_decoder.configuration_vi…

kindly guide me how reslove this issue loaded_model = VisionEncoderDecoderModel.from_pretrained('/content/drive/MyDrive/ocr_pth/checkpoint-5000') processor = TrOCRProcessor.from_pretrained("/conte…

arain60gb updated 1 year ago
1
open-mmlab/mmdetection #10932

MMDetection to support LORA for it's vision transformers

MMDetection includes both SWIN and DETR, if I understand the concept correctly, both could be fine-tuned with LORA in a fast and memory efficient manner. Support for training with LORA in object d…

GeorgePearse updated 6 months ago
7
xenova/transformers.js #793

jinaai/jina-clip-v1: support for model names with prefixes

### Model description [jinaai/jina-clip-v1](https://huggingface.co/jinaai/jina-clip-v1/tree/main/onnx) ### Prerequisites - [X] The model is supported in Transformers (i.e., listed [here](https://hu…

do-me updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vision-transformers

1000+ results
for vision-transformers