microsoft / computervision-recipes

Best Practices, code samples, and documentation for Computer Vision.
MIT License
9.34k stars 1.16k forks source link

[FEATURE_REQUEST] Add vision transformers model to image classification #678

Open kbjiang opened 1 year ago

kbjiang commented 1 year ago

Description

The transformer-based image classification model is becoming popular. It will be nice to include it in this repo.

Expected behavior with the suggested feature

Other Comments

ShreyasJoshi7 commented 4 months ago

@kbjiang I have created a PR which demonstrates the inference of transformer-based image classification using DeiT. I would appreciate if you could have a look at it. Let me know if you need more things to be added to it. This is the link to the PR Cheers!