WZMIAOMIAO / deep-learning-for-image-processing

deep learning for image processing including classification and object-detection etc.
GNU General Public License v3.0
23.25k stars 8.01k forks source link

请教vit模型,百度网盘里面的权重是怎么得到的?自己重新训练的还是从官方实现的npz权重转换过来的? #807

Closed ShihuaiXu closed 5 months ago

ShihuaiXu commented 6 months ago

def vit_base_patch16_224(num_classes: int = 1000): """ ViT-Base model (ViT-B/16) from original paper (https://arxiv.org/abs/2010.11929). ImageNet-1k weights @ 224x224, source https://github.com/google-research/vision_transformer. weights ported from official Google JAX impl: 链接: https://pan.baidu.com/s/1zqb08naP0RPqqfSXfkB2EA 密码: eu9f """ model = VisionTransformer(img_size=224, patch_size=16, embed_dim=768, depth=12, num_heads=12, representation_size=None, num_classes=num_classes) return model 百度网盘里的权重,weights ported from official Google JAX impl是什么意思?直接从npz模型转过来的,还是自己重新训练得到了这个模型?

WZMIAOMIAO commented 5 months ago

从官方npz转过来的