Question about reproducing the DP3 algorithm in real-device environment

YanjieZe / 3D-Diffusion-Policy

[RSS 2024] 3D Diffusion Policy: Generalizable Visuomotor Policy Learning via Simple 3D Representations

MIT License

552 stars 52 forks source link

We would like to collect data on our own real-device environment to reproduce the DP3 algorithm. We are using a stationary L515 camera, with the body being a RealMan robotic arm and gripper. During the training after data collection, we found that the loss barely decreases. We would like to ask about the following questions: (1) The pointcloud data input into the network is in the camera coordinate system, while the end effector pose in the actions is in the base coordinate system of the robotic arm. Can this be learned? (2) Sometimes the end effector of the robotic arm is outside the camera's visible range. Does this have a significant impact? (3) During trajectory collection, the robotic arm sometimes obscures the target object. Will this have an impact?

YanjieZe / 3D-Diffusion-Policy

Question about reproducing the DP3 algorithm in real-device environment #89