Open hanzhi0410 opened 1 week ago
In the paper, it's mentioned that the alive reward encourages the robot not to fail. In the code, this is implemented by awarding 1 point for each step the robot takes, which motivates the agent to run for as many steps as possible, essentially encouraging it to stay alive and avoid failure. This is my understanding.
Hello, thank you for your work. I have read your code and found that there is an interesting item in the reward function called _rewards _alive. I am not sure what the specific purpose of doing this is. Can you explain it to me? Looking forward to your reply