is the randomization of token in training intented?

exitudio / MMM

Official repository for "MMM: Generative Masked Motion Model"

https://exitudio.github.io/MMM-page/

78 stars 4 forks source link

is the randomization of token in training intented? #14

Closed WilsonWangTHU closed 4 months ago

WilsonWangTHU commented 4 months ago

Really like this work!

I notice that during training, 50% of the tokens are randomly assigned to another code before applying the masking process here.

Is this intended? And I wonder what the intuition for that is.

exitudio commented 4 months ago

Thank you for your interest.

During inference, the model predicts masked tokens iteratively. If it makes a wrong prediction in the early iterations, it can negatively impact predictions in later iteration. To minimize this accumulated error, we added noise during training by randomly replacing some input tokens with random tokens. This helps the model to be robust to the error that might happen during inference.

WilsonWangTHU commented 4 months ago

ty! that makes a lot of sense