Closed chendayuxixi closed 4 months ago
Hi, thanks for your interests.
The reward design are mainly refered to this paper, and you can also find more details in my slides. Specific rewards are defined in task config files, and computed in this function for each environment.
thank you very much!