jeonsworld / ViT-pytorch

Pytorch reimplementation of the Vision Transformer (An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale)
MIT License
1.95k stars 374 forks source link

[ Softmax() missing ] #41

Closed IemProg closed 3 years ago

IemProg commented 3 years ago

Thanks for sharing the ViT implementation, wonderful work.

I'm wondering why you are not using the Softmax() function after head component -from features to classes-, as you did in the Jupyter notebook example?

Thanks