deit-iii Search Results

26 results
for deit-iii

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/vision #8514

Add 3-augment from DeiT III

### 🚀 The feature As the title suggest, add the data augmentation from https://arxiv.org/abs/2204.07118 ### Motivation, pitch This seems to be a simple recipe with good results and the Deit family …

trawler0 updated 3 months ago
5
facebookresearch/deit #217

how to implement cosub training use deit-III

Hi, i want to use cosub trainging method for custom images classification work, but i don't find implemention in the code. Does this work contains the details of cosub training method. If yes, my com…

xiaoguang-1 updated 1 year ago
2
facebookresearch/deit #175

What are the hyperparameters for finetuning on cifar10/100, …

Cannot find it in the paper? Is it the same as training on imagenet?

Phuoc-Hoan-Le updated 1 year ago
2
facebookresearch/dinov2 #252

Learning rate used for fine-tuning on ImageNet-1k

With regards to this sentence: > In Table 5, we show that the Top-1 accuracy on the validation set of ImageNet-1k improves by more than +2% when the backbone is fine-tuned. What was the learning…

MohammedSB updated 2 months ago
8
e4exp/paper_manager_abstract #426

Visformer: The Vision-friendly Transformer

- https://arxiv.org/abs/2104.12533 - 2021 昨年は、Transformerモジュールを視覚問題に適用することが急速に発展しました。一部の研究者は、Transformerベースのモデルがデータをフィットさせる能力に優れていることを実証しているが、特に学習データが限られている場合には、これらのモデルが過剰にフィットしてしまうことを示す証拠がまだ増えて…

e4exp updated 3 years ago
3
lucidrains/rotary-embedding-torch #26

LieRE: Generalizing Rotary Position Encodings. Beats RoPE-mi…

Hi, @lucidrains ! There was a promising research published this month (vs. RoPE-mixed (#25) in March), the so-called LieRE positional encodings generalize the kv-vector rotation to any numbers of d…

kabachuha updated 2 weeks ago
28
facebookresearch/deit #234

Slow Training

Hi and thanks a lot for this codebase! I'm trying to reproduce your results with a Deit-small, but training is very slow, with more than 2h/epoch. I have tried the deit commands from the readme,…

mueller-mp updated 1 year ago
2
facebookresearch/deit #198

How long is it supposed to take to train on ImageNet21k for …

From DeiT III: Revenge of the ViT (https://arxiv.org/pdf/2204.07118.pdf), how long exactly (exact number of hours) does it take to pretrain for 90 epochs on ImageNet21k with 8 V100 GPUs.

Phuoc-Hoan-Le updated 1 year ago
1
facebookresearch/deit #181

ImageNet21k pretrained model without finetuning on 1k

Hi I'm trying to reproduce DeiT-III's ImageNet-21k results Could you please share ViT-B weight which is pretrained on 21k but not fine-tuned on 1k? It will be a great help to check and validate m…

bhheo updated 1 year ago
2
facebookresearch/deit #161

--nb_classes for ImageNet-21k pretraining

Hi Hugo, I noticed that the suggested command for pretraining DeiT III on IN-21k set --nb_classes to 1000, which is smaller than the number of classes of IN-21k. Is it a typo? Best, Blakey

tgxs002 updated 1 year ago
2

上一页 1...1 2 3...3 下一页

26 results for deit-iii

26 results
for deit-iii