I try the task of Pendulum-v0. Here is an interesting scene: the moment switches the direction when the stick is at the bottom other than when the stick falls. But, a better solution is that the direction of the moment is always the same as the speed direction of the stick. When the stick falls, the moment should change the direction immediately. Thus, I have a question about the optimal property of this MPC solution.
I try the task of Pendulum-v0. Here is an interesting scene: the moment switches the direction when the stick is at the bottom other than when the stick falls. But, a better solution is that the direction of the moment is always the same as the speed direction of the stick. When the stick falls, the moment should change the direction immediately. Thus, I have a question about the optimal property of this MPC solution.