ground truth - Githubissues

AliasChenYi commented 3 months ago

Hello, I would like to ask you what should be done in real 2D pose estimation training?

AliasChenYi commented 3 months ago

How to use GT 2D pose training.Can you help me?

SoroushMehraban commented 3 months ago

@AliasChenYi Mentioned here before #5 What do you mean by real 2D pose estimation training?

AliasChenYi commented 3 months ago

We use the Stacked Hourglass 2D pose detection results and 2D ground truths on Human3.6M.The second item in your paper

SoroushMehraban commented 3 months ago

Yeah the 2D ground truth is explained here #5.

AliasChenYi commented 3 months ago

I get it, thank you very much.

AliasChenYi commented 3 months ago

I have another doubt, the configuration file written on the data/motion2d/ this directory, but actually I do not have this directory, look at other code seems to be useless, can you ask me what it does?

SoroushMehraban commented 3 months ago

Ohhh I believe that's from the MotionBERT that I forgot to delete. MotionBERT also has some 2D dataset that it uses them for pretraining such as PoseTrack and InstaVariety (see here for details). But I didn't use it. The data_root_2d on MotionAGFormer is useless.

AliasChenYi commented 3 months ago

Understood, then we use a GT-2D training, do not need to process cutting frames, such as 27 frames, 81 frames...

---Original--- From: "Soroush @.> Date: Wed, Jul 24, 2024 23:20 PM To: @.>; Cc: @.**@.>; Subject: Re: [TaatiTeam/MotionAGFormer] ground truth (Issue #46)

Ohhh I believe that's from the MotionBERT that I forgot to delete. MotionBERT also has some 2D dataset that it uses them for pretraining such as PoseTrack and InstaVariety (see here for details). But I didn't use it. The data_root_2d on MotionAGFormer is useless.

— Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you were mentioned.Message ID: @.***>

SoroushMehraban commented 3 months ago

For MotionAGFormer-XS and MotionAGformer-S we do.

AliasChenYi commented 3 months ago

For MotionAGFormer-XS and MotionAGformer-S we do.

MotionAGFormer-b and MotionAGFormer-L don't know if you did that? Do I have to do it again if I train here?

AliasChenYi commented 3 months ago

ohh, I mean for GT-2d data training, do not need to re-stacked Hourglass 2D data such as: python h36m.py --n-frames 243 such instructions?

For MotionAGFormer-XS and MotionAGformer-S we do.

SoroushMehraban commented 3 months ago

We still have to do it because that preprocessing handles both 2D stacked hour glass + 3D ground truth. And the 2D ground truth is coming from 3D ground truth so still it's needed.

AliasChenYi commented 3 months ago

Ok, thank you very much for your reply, I will implement it right now.

AliasChenYi commented 3 months ago

Sorry to bother you, I would like to ask what this train_2d means. Why is it not available in xs and s, but in b and large versions? And my result in the process of reproduction is always lower than that of the paper. Can you give me some advice?

AliasChenYi commented 3 months ago

We still have to do it because that preprocessing handles both 2D stacked hour glass + 3D ground truth. And the 2D ground truth is coming from 3D ground truth so still it's needed.

In the Base version of the original paper, the P1 result was 38.4 and P2 was 32.6, but the P1 result I reproduced was 38.9 and P2 was 32.7, and the best effect was reached in the 17th round of training, and the training effect would only get worse and the fluctuation would be unstable. Is this a normal phenomenon?

SoroushMehraban commented 3 months ago

@AliasChenYi Your first question: It's another hyperparameter of MotionBERT that I didn't use and forgot to delete. Forget about it (it is for including 2D dataset in pretraining that we don't have such a thing).

Second question: From my experiments a year back I noticed that when I change GPU from A40 to something else or when I change the batch size the final result is slightly worse. I believe you can't replicate the exact same result unless you have the same environment that I had.

SoroushMehraban commented 3 months ago

And forgot to mention, it is ok to have fluctuations after first few epochs.

AliasChenYi commented 3 months ago

And forgot to mention, it is ok to have fluctuations after first few epochs.

Is it true that all versions achieve the best results through 90 epochs, because during the training process I observed that a good effect can be obtained within the first 20 epochs?

SoroushMehraban commented 3 months ago

Honestly don't remember it. Unfortunately I accidentally deleted the logs of training and not sure about it.

AliasChenYi commented 3 months ago

Honestly don't remember it. Unfortunately I accidentally deleted the logs of training and not sure about it.

Thank you very much for your reply

AliasChenYi commented 3 months ago

Honestly don't remember it. Unfortunately I accidentally deleted the logs of training and not sure about it.

I apologize for disturbing you again, I would like to ask which part of the code is used to calculate the MCAs and MACs/frame metrics?

SoroushMehraban commented 3 months ago

@AliasChenYi Answered here #16

SoroushMehraban commented 3 months ago

For MACs/frame you can simply divide it by the number of frames you have as the input. The reason why I had such a thing is because some models (e.g. PoseFormerV2) have center frame prediction and need to have the forward procedure F times to have the same number of outputs.

AliasChenYi commented 3 months ago

For MACs/frame you can simply divide it by the number of frames you have as the input. The reason why I had such a thing is because some models (e.g. PoseFormerV2) have center frame prediction and need to have the forward procedure F times to have the same number of outputs.

Ohh, Thank you very much！

TaatiTeam / MotionAGFormer

ground truth #46