Training loss is relatively large

real-stanford / diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion

https://diffusion-policy.cs.columbia.edu/

MIT License

1.09k stars 203 forks source link

Training loss is relatively large #80

Closed COST-97 closed 1 month ago

COST-97 commented 1 month ago

Hello: I used a similar diffusion transformer policy and trained on the calvin dataset, but the noise loss of the action could only drop from about 1 to about 0.2, and could not continue to decrease. I read in the paper that the loss can be lower than 0.004. Is the model I trained underfitting? Have you encountered similar problems?