cmhungsteve / Awesome-Transformer-Attention

An ultimately comprehensive paper list of Vision Transformer/Attention, including papers, codes, and related websites
4.64k stars 489 forks source link
attention-mechanism attention-mechanisms awesome-list computer-vision deep-learning detr papers self-attention transformer transformer-architecture transformer-awesome transformer-cv transformer-models transformer-with-cv transformers vision-transformer visual-transformer vit

Ultimate-Awesome-Transformer-Attention Awesome

This repo contains a comprehensive paper list of Vision Transformer & Attention, including papers, codes, and related websites.
This list is maintained by Min-Hung Chen. (Actively keep updating)

If you find some ignored papers, feel free to create pull requests, open issues, or email me.
Contributions in any form to make this list more comprehensive are welcome.

If you find this repository useful, please consider citing and ★STARing this list.
Feel free to share this list with others!

[Update: January, 2024] Added all the related papers from NeurIPS 2023!
[Update: December, 2023] Added all the related papers from ICCV 2023!
[Update: September, 2023] Split the multi-modal paper list to README_multimodal.md
[Update: June, 2023] Added all the related papers from ICML 2023!
[Update: June, 2023] Added all the related papers from CVPR 2023!
[Update: February, 2023] Added all the related papers from ICLR 2023!
[Update: December, 2022] Added attention-free papers from Networks Beyond Attention (GitHub) made by Jianwei Yang
[Update: November, 2022] Added all the related papers from NeurIPS 2022!
[Update: October, 2022] Split the 2nd half of the paper list to README_2.md
[Update: October, 2022] Added all the related papers from ECCV 2022!
[Update: September, 2022] Added the Transformer tutorial slides made by Lucas Beyer!
[Update: June, 2022] Added all the related papers from CVPR 2022!


Overview

------ (The following papers are moved to README_multimodal.md) ------

------ (The following papers are moved to README_2.md) ------


Citation

If you find this repository useful, please consider citing this list:

@misc{chen2022transformerpaperlist,
    title = {Ultimate awesome paper list: transformer and attention},
    author = {Chen, Min-Hung},
    journal = {GitHub repository},
    url = {https://github.com/cmhungsteve/Awesome-Transformer-Attention},
    year = {2022},
}

Survey

[Back to Overview]

Image Classification / Backbone

Replace Conv w/ Attention

Pure Attention

[Back to Overview]

Vision Transformer

General Vision Transformer

[Back to Overview]

Attention-Free

MLP-Series

[Back to Overview]

Analysis for Transformer

[Back to Overview]

Detection

Object Detection

[Back to Overview]

3D Object Detection

[Back to Overview]

Multi-Modal Detection

[Back to Overview]

HOI Detection

[Back to Overview]

Salient Object Detection

[Back to Overview]

Other Detection Tasks

[Back to Overview]

Segmentation

Semantic Segmentation

[Back to Overview]

Depth Estimation

[Back to Overview]

Object Segmentation

[Back to Overview]

Other Segmentation Tasks

[Back to Overview]

Video (High-level)

Action Recognition

[Back to Overview]

Action Detection/Localization

[Back to Overview]

Action Prediction/Anticipation

[Back to Overview]

Video Object Segmentation

[Back to Overview]

Video Instance Segmentation

[Back to Overview]

Other Video Tasks

[Back to Overview]


References