Closed tanshuai0219 closed 5 months ago
Thank you both for showing interest in our work! I have added a link to the weights used for the backbone in the downstream task.
Hi there, I've downloaded the task weights and attempted to load the state dictionary into the model, but I encountered some missing keys related to the temporal transformer. A gist of a notebook can be found here Thanks for your attention :)
please set the number of layers to 2 when initialising the model (as described in the paper)
+1