Open sctrueew opened 3 years ago
@zpmmehrdad I think the most users can't use good Deit models, because they are private and it require significant hardware to: train+fine-tune+distill it.
It seems in public there is only model without Fine-tuning-384x384 and without Distillation with 81.8% Top1: https://github.com/facebookresearch/deit#model-zoo
In the paper, there are models:
There are used data-efficient image Transformers & Distillation:
RegNetX-16GF
- 82.9 Top1 https://github.com/rwightman/pytorch-image-models/blob/2ed8f247154870be7acc1908fde0a7f457f67456/timm/models/regnet.py#L396-L399@AlexeyAB Hi,
Thanks for the explanation.
Hi @AlexeyAB,
Can we have this DeiT?
Thanks