jeonsworld / ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
MIT License
1.95k stars 374 forks source link

reconstruction task #59

Open WenBingo opened 1 year ago

WenBingo commented 1 year ago

Would you like to ask about the model code for the reconstruction task ? Or the author Do you have any plans to do the reconstruction task ?