Open alexcbb opened 6 months ago
One of the most important part of the model is the spatio-temporal transformer layer used in the model. It is a memory efficient version from ViT
This layer combines the idea from the original idea with Vision Transformer.
Feature details
One of the most important part of the model is the spatio-temporal transformer layer used in the model. It is a memory efficient version from ViT
This layer combines the idea from the original idea with Vision Transformer.
What needs to be done