johko / computer-vision-course

This repo is the homebase of a community driven course on Computer Vision with Neural Networks. Feel free to join us on the Hugging Face discord: hf.co/join/discord
MIT License
456 stars 142 forks source link

Visualization (the difference between convolution and transformer) #35

Closed hwaseem04 closed 12 months ago

hwaseem04 commented 12 months ago

Went through your proposed curriculum, and it is really amazing.

Just my suggestion, you can also look into this recent interpretation for ViT. For example: CVPR-2023

When it comes to ViT, the intermediate interpretations are not well explored as the field is emerging, it would be really helpful to the community if you can add the above part in your curriculum