Open farzadab opened 5 years ago
Video: crazy tip-heel with no residual action + RSI + imitation reward (3D).
[env commit ID: 6cf6754
]
[experiment ID: 2019_06_04__13_35_30__cassie-nomocap
]
It seems like the positive reward of speed, orientation, and CoM are too high, so it can get away with not imitating the joint motion ➜ adjusting the weights.
Bug: when residual_control
is False
, having a DeviationRew
does not make sense.
Video: crouched walk with constant residual - RSI + progress reward (3D)
[env commit ID: be44328
]
[experiment ID: 2019_06_05__17_22_31__cassie-shinspring
]
Does a weird movement with the left leg at the start.
Video: improved the tip-heel by using more imitation reward, less energy use, no deviation reward, and some initial velocity.
[env commit ID: 8781941e
]
[experiment ID: 2019_06_07__15_04_43__cassie-rsi-rew6-speed
]
@ZhaomingXie pointed out a bug: if the policy uses a Tanh
at the end it can't reach all possible positions.
Ideas for debugging after talking with @mvandepanne:
Other ideas: