Input normalization - Githubissues

Hi, thanks for your great work! It is very easy to work with!

I‘m working on using the model for inference and I‘m wondering why was the decision made to normalize with regards to the image dimensions? Wouldn‘t normalizing in reference to the pose make more sense? So for example pelvis is 0,0 and the sequence absolute max determines the scale? As it is, the performance starts to degrade with very small keypoints. What do you think should be best practice for normalizing input keypoints in the wild?

Thanks a lot!

TaatiTeam / MotionAGFormer

Input normalization #51