Closed tqosu closed 4 weeks ago
VisionTransformerCP
is the rewrite of the official VisionTransformer used in VideoMAE, but supports activation checkpointing (with_cp=True
).
VisionTransformerLadder
is the side-tuning architecture of VideoMAE, which is introduced in AdaTAD Figure 5.
Thanks.
Hi Shuming,
What are VisionTransformerCP and VisionTransformerLadder?
Thanks.