real-stanford / diffusion_policy

[RSS 2023] Diffusion Policy Visuomotor Policy Learning via Action Diffusion
https://diffusion-policy.cs.columbia.edu/
MIT License
1.09k stars 203 forks source link

Training loss is relatively large #80

Closed COST-97 closed 1 month ago

COST-97 commented 1 month ago

Hello: I used a similar diffusion transformer policy and trained on the calvin dataset, but the noise loss of the action could only drop from about 1 to about 0.2, and could not continue to decrease. I read in the paper that the loss can be lower than 0.004. Is the model I trained underfitting? Have you encountered similar problems?