PaddlePaddle / PARL

A high-performance distributed training framework for Reinforcement Learning
https://parl.readthedocs.io/
Apache License 2.0
3.24k stars 819 forks source link

关于paddle奖励设置问题 #912

Closed young-shy closed 2 years ago

young-shy commented 2 years ago

请问paddle库 有算法是对奖励值进行了归一化处理吗?

TomorrowIsAnOtherDay commented 2 years ago

你具体指的是哪个工作(算法)呢?

young-shy commented 2 years ago

抱歉,来晚了。之前没太在意,因为看这里介绍说是paddle库进行了这样的处理,但是看代码并非都是如此,TD3就没有直接对reward就行这样的处理 image

young-shy commented 2 years ago

:)