Train deep reinforcement learning model for robotics grasping. Choose from different perception layers raw Depth, RGBD and autoencoder. Test the learned models in different scenes and object datasets
The PPO algorithm is difficult to converge,and the gripper always move up and away from the table.Could you please give me some hint about it.Sincerely appreciate it!
The PPO algorithm is difficult to converge,and the gripper always move up and away from the table.Could you please give me some hint about it.Sincerely appreciate it!