Closed RyanLiu112 closed 1 year ago
Sorry for the delay. My thinking would be that for the drawer open task, the arm needs to grip the handle to open the drawer so near object is set up in a way that, in the reward computation, the agent is guided to grip the handle via the distance between the drawer handle and the hand. The button press top down task would not require gripping, and therefore the arm can keep the gripper closed and press the button with the tip(s) of the end effector "fingers." I hope this helps.
Thanks for your response and it is clear. near_object
flag of the button press task should be fixed since it never turns True
using the scripted policy.
Hi,
I use both the scripted policy in the Meta-World and the expert policy trained by SAC to rollout trajectories in the
Button Press
task. But thenear_object
flag in the returnedinfo
never turnsTrue
. When I tryDrawer Open
task, thenear_object
turnsTrue
when the arm approaches the drawer. I find that thenear_object
flag in theDrawer Open
task is computed according the position of thehand
of the arm, but thenear_object
in theButton Press
task is computed based on the position of the center ofleftEndEffector
andrightEndEffector
. What's the difference of these two positions? Is it more proper to use the position of thehand
of the arm to computenear_object
in theButton Press
task?