Closed zcheng19 closed 1 year ago
在PPO-discrete/PPO_discrete.py中的111行,gae = delta + self.gamma self.lamda gae (1.0 - d) 有问题,应该是 gae = gae + self.gamma self.lamda delta (1.0 - d) 吧
在PPO-discrete/PPO_discrete.py中的111行,gae = delta + self.gamma self.lamda gae (1.0 - d) 有问题,应该是 gae = gae + self.gamma self.lamda delta (1.0 - d) 吧