EricGuo5513 / momask-codes

Official implementation of "MoMask: Generative Masked Modeling of 3D Human Motions (CVPR2024)"
https://ericguo5513.github.io/momask/
MIT License
810 stars 70 forks source link

Forward Pass Residual Transformer #85

Closed wngTn closed 2 hours ago

wngTn commented 2 hours ago

Greetings,

With the residual transformer, you want to predict the residual layer tokens. So I am a little bit confused as to why you take the tokens of the first $n - 1$ layers here. That is, you take the base layer + 4 residual layers. Shouldn't you take all_indices[..., 1:] to get the residual tokens?

wngTn commented 2 hours ago

Ah nvm, to predict the $n$-th layer, it needs to take the $n-1$ previous layers as input.