Sorry, I am not familiar with the ViT arch.
It has been used in both Encoder and Decoder Transformer architecture, which mainly focused on NLP tasks.
If you develop your ViT DNN using huggingface, it is not hard to accelerate, at least part of, your code using Turbo.
Sorry, I am not familiar with the ViT arch. It has been used in both Encoder and Decoder Transformer architecture, which mainly focused on NLP tasks. If you develop your ViT DNN using huggingface, it is not hard to accelerate, at least part of, your code using Turbo.