Closed HuFY-dev closed 1 year ago
Hi @HuFY-dev, since the model was trained with PyTorch DistributedDataParallel
(DDP) it implicitly wraps the model in extra module.
parameter. Just putting that line before your data loading is the correct way to load the model without DDP.
Thank you!
Firstly, thanks for the amazing work! I implemented my own code loading your pre-trained model,
IN1K-vit.h.16-448px-300e.pth
, and encountered this issue:I used the exact same model architecture in your
vision_transformer.py
file, and the problem was solved after I added this line before loading:I wonder if there are any issues in your released model weights. Did you forget to update to the newest version?