Closed ajtao closed 1 year ago
Thank you! ☺️ Sorry about my late reply! Yes, this ablation variant only used the first transformer layer "inner". Please see Eq. (6) and Eq. (7) in the supplementary material for the exact formulation:
Feel free to let me know if you have any follow-up questions!
Best, Honglu
Hello, congrats on the amazing work!
For Table 2 of the Supplementary material, it shows that 1-scale keypoint model achieved 91.2 test accuracy. I was wondering if you can say a little bit about what portion of the model this was. Was this a model that only used the first transformer layer "inner" and not the other 3 transformer layers (middle, outer, group)?