vision-transformers Search Results

1000+ results
for vision-transformers

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

e4exp/paper_manager_abstract #299

Can Vision Transformers Learn without Natural Images?

- https://arxiv.org/abs/2103.13023 - 2021 ヴィジョン・トランスフォーマー(ViT)の事前学習は、自然な画像と人の注釈付きラベルがなくても完了できるのだろうか？ ViTの事前学習には、大規模データセットと人の注釈付きラベルに大きく依存しているように見えるが、最近の大規模データセットには、プライバシー侵害、不十分な公平性の保護、手間のかかる注釈などの…

e4exp updated 3 years ago
2
gatheluck/PaperReading #248

[2021] Do Vision Transformers See Like Convolutional Neural …

## 論文リンク - [arXiv](https://arxiv.org/abs/2108.08810) ## 公開日（yyyy/mm/dd） 2021/08/19 Google Research Brain Team ## 概要 ## TeX ``` % yyyy/mm/dd @article{ raghu2021do, title={Do …

gatheluck updated 2 years ago
6
huggingface/optimum-neuron #478

LLaVA support

### Feature request The support is [already present in huggingface/transformers](https://github.com/huggingface/transformers/pull/27662). But when I try to export LLaVA model to neuron format, i…

lifo9 updated 1 month ago
3
facebookresearch/dinov2 #285

Attention Map Visualization Script in "Vision Transformers N…

Hi there, thanks for merging #282! I was wondering if you could release the script you used for visualizing the attention maps in the [VISION TRANSFORMERS NEED REGISTERS](https://arxiv.org/pdf/230…

JihwanEom updated 6 months ago
2
reyllama/paper-reviews #149

Do Vision Transformers See Like Convolutional Neural Network…

## TL; DR - ViT feature representations are *less hierarchical*. - Early tr blocks learn both local and global dependencies provided with large enough dataset. - Skip connections play much more i…

reyllama updated 2 years ago
2
comfyanonymous/ComfyUI #2055

ImportError: cannot import name 'get_full_repo_name' from 'h…

Total VRAM 6144 MB, total RAM 32509 MB Traceback (most recent call last): File "F:\ai\ComfyUI-master\comfy\model_management.py", line 218, in import accelerate File "C:\Users\Ryan\AppData…

Ryan-shadow updated 2 days ago
32
huggingface/transformers #31544

Nested from_pretrained() gives warnings loading weights - "c…

I am seeing the pytorch warnings "copying from a non-meta parameter in the checkpoint to a meta parameter in the current model, which is a no-op" when loading the CLIP vision tower of a LLaVA model (i…

jamt9000 updated 1 week ago
1
TRI-ML/prismatic-vlms #34

`unpack_tuple()` is no longer correct with timm v1.0.3

timm v1.0.3 was just released 2 hours ago (https://github.com/huggingface/pytorch-image-models/releases/tag/v1.0.3) and it seems like they've reworked the API for `forward_intermediates()` and it retu…

yukw777 updated 1 month ago
2
magic-research/PLLaVA #20

Issue with multi-GPU inference

I tried to run the demo on multiple RTX 3090 but got strange errors: ``` python3.10/site-packages/transformers/cache_utils.py", line 146, in update self.key_cache[layer_idx] = torch.cat([self.k…

AmitRozner updated 3 hours ago
3
aws-neuron/transformers-neuronx #88

llava support

llava multimodel would be huge to be supported for aws neuron chips https://huggingface.co/llava-hf/llava-v1.6-mistral-7b-hf This in particular is trending I'm not sure if this is the correct…

sonic182 updated 1 week ago
3

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for vision-transformers

1000+ results
for vision-transformers