PaddlePaddle / PARL

A high-performance distributed training framework for Reinforcement Learning
https://parl.readthedocs.io/
Apache License 2.0
3.24k stars 819 forks source link

这里第2个log_prob似乎跟第1个log_prob毫无关系 #908

Closed amocken closed 2 years ago

amocken commented 2 years ago

https://github.com/PaddlePaddle/PARL/blob/develop/parl/algorithms/paddle/sac.py 那第一个log_prob的存在意义是啥? image

TomorrowIsAnOtherDay commented 2 years ago

注意下81行的计算符号是【-=】,不是【=】,多了个减号

amocken commented 2 years ago

注意下81行的计算符号是【-=】,不是【=】,多了个减号

不好意思,是我大意了😅