ChenFengYe / motion-latent-diffusion

[CVPR 2023] Executing your Commands via Motion Diffusion in Latent Space, a fast and high-quality motion diffusion model
https://chenxin.tech/mld/
MIT License
586 stars 55 forks source link

Results Stage1 KIT Dataset #50

Open AlessioSam opened 1 year ago

AlessioSam commented 1 year ago

Hi, thank you for your work, I find it brilliant and really useful for future developments in this field!

Regarding the HumanML3D dataset, the results are easily replicable. However when trying to train the model on the KIT dataset I am having trouble getting your results, similar to issue #44.

In stage1, what happens is that the model seems not to learn to reconstruct joints and features.

Is it possible to have your checkpoint so I can check that indeed the dataset is not corrupted? Or if you have any other advice (hyperparameters, ...)

After 20k epochs I have: FID: 26.7 R_precision_top_1: 0.199 AVE_mean_pose: 2145 AVE_mean_joints: 79551

In contrast, the metrics for GT are correct.

Looking forward to your reply! Thanks, Alessio sampieri@diag.uniroma1.it

weleen commented 1 year ago

@AlessioSam I have the same problem, have you solved it?

ChenFengYe commented 1 year ago

@weleen @AlessioSam Hi Weleen and AlessioSam, could you provide more details about this implementation? The FID metric shows that this training is broken. Is there anything about loss?

weleen commented 1 year ago

@ChenFengYe

image

This is my test results.