I f you want to use a pre trained Transformer for the same task, how would you use it instead of LSTM here? For example I want to use a lightweight BERT model, what would ne the changes to the line in the end? Trying to grasp the knowledge of the architecture.
If it's about images, you would need to use Image Transformer, and to answer your question I would need to create a separate tutorial. Can't give you a quick question without trying to do it myself
I f you want to use a pre trained
Transformer
for the same task, how would you use it instead ofLSTM
here? For example I want to use a lightweightBERT
model, what would ne the changes to the line in the end? Trying to grasp the knowledge of the architecture.