junxnone / aiwiki

AI Wiki
https://junxnone.github.io/aiwiki
18 stars 2 forks source link

ML Transformer T2TViT #227

Open junxnone opened 3 years ago

junxnone commented 3 years ago

Reference

Brief

如下图, 分析了 ResNet & ViT & T2T-ViT 学习到的 low level feature

image

Arch

image

Tokens to Token / T2T Module

Steps Formula Description
Re-structurization image
image
Soft Split Image : (kernel, strde, padding) = (7, 4, 2)
Tokens : (kernel, strde, padding) = (3, 2, 1)

image

Backbone

junxnone commented 3 years ago

junxnone/aiwiki#226