raoyongming DynamicViT issues

raoyongming / DynamicViT

[NeurIPS 2021] [T-PAMI] DynamicViT: Efficient Vision Transformers with Dynamic Token Sparsification

https://dynamicvit.ivg-research.xyz/

MIT License

576 stars 72 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Question about the keep/drop token

#48 King4819 opened 6 months ago
0
About BP problem mentioned in the introduction

#47 Cooperx521 opened 7 months ago
2
About BP problem

#46 Cooperx521 closed 7 months ago
0
Token keep ratio setting

#45 King4819 opened 7 months ago
0
RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cuda:1! (when checking argument for argument index in method wrapper_scatter__value)

#44 King4819 closed 8 months ago
2
Gradient accumulation code implement

#43 King4819 opened 10 months ago
0
DeiT lr question

#42 King4819 closed 11 months ago
1
Reproduce the result on single gpu

#41 King4819 closed 11 months ago
6
Flops after discarding tokens

#40 King4819 closed 1 year ago
1
ViT on CIFAR-100

#39 King4819 closed 1 year ago
7
Dynamic token on training speedup

#38 ZK-Zhou closed 1 year ago
1
About some innovative ideas

#37 vegetableclean opened 1 year ago
0
．

#36 vegetableclean closed 1 year ago
0
Regarding connection elimination of the dropped patches.

#35 Alihjt closed 1 year ago
3
MMdetection version

#34 Rkyzzy opened 1 year ago
0
A few question about Deit Model used in Dynamic Vit

#33 secretu closed 1 year ago
1
The problem of giving up a fixed number of tokens during the inference stage.

#32 MooresS closed 1 year ago
1
Reproducing throughput results

#31 ivanke1 closed 1 year ago
1
Any hints on Batch=1 inference?

#30 lixinghe1999 closed 1 year ago
2
Code for Table 1 in the paper

#29 xXuHaiyang closed 1 year ago
0
请教一下关于class “AdaSwinTransformerBlock"中 forward function 中x1，x2的问题

#28 LucasZhan closed 1 year ago
3
Fail to reproduce accuracy of DynamicViT-B/0.7: lower accuracy than reported

#27 ShiFengyuan1999 closed 1 year ago
8
Temperature in Gumbel Softmax

#26 kaikai23 closed 1 year ago
1
[Questions] Dyswin feature output shape

#25 zafirshi closed 2 years ago
2
Is the subscript 'i' of Z_global in equation 4 a mis-type?

#24 LucasZhan closed 2 years ago
2
Implementation details are so largely different from the paper description

#23 ming1993li closed 2 years ago
4
Some questions about your code

#22 leoozy closed 2 years ago
2
Have you tested your latency on GPU?

#21 leoozy closed 2 years ago
1
update

#20 liuzuyan closed 2 years ago
0
Flops tools

#19 waynelrs closed 2 years ago
2
DynamicVIT training stored checkpoint

#18 SwapnilDreams100 closed 2 years ago
2
Can't reproduce the accuracy of pre-trained models

#17 xiyiyia closed 2 years ago
2
论文细节请教

#16 wangning7149 closed 2 years ago
1
关于训练和测试阶段裁剪token的策略

#15 1171000410 closed 2 years ago
6
GFLOPs and Throughput

#14 AmeenAli closed 2 years ago
2
[ LV-ViT-S Pretrained Model ]

#13 IemProg closed 2 years ago
1
About distill

#12 hegc closed 2 years ago
2
Attention mask computation during training

#11 mtchiu2 closed 2 years ago
2
关于block中的残差通道

#10 bestfleer closed 2 years ago
1
FLOPs

#9 Cydia2018 closed 2 years ago
2
test-pr

#8 dirtycomputer closed 2 years ago
1
pretrained model download

#7 dirtycomputer closed 2 years ago
2
Loss is nan when training my own dataset

#6 InfinityBox closed 2 years ago
2
Structural downsampling and static token sparsification

#5 Yeez-lee opened 3 years ago
3
Pretrain LV-ViT-S and LV-ViT-M

#4 kristenkn closed 3 years ago
1
关于Attention Masking创新点的疑问

#3 xmu-xiaoma666 closed 3 years ago
3
update visualization example

#2 wl-zhao closed 3 years ago
0
visualization

#1 wl-zhao closed 3 years ago
0