dragen1860 / Deep-Learning-with-TensorFlow-book

深度学习入门开源书,基于TensorFlow 2.0案例实战。Open source Deep Learning book, based on TensorFlow 2.0 framework.
http://www.ai101edu.com
13.2k stars 4.09k forks source link

A3C代码有问题 #214

Open muxixixixi opened 4 years ago

muxixixixi commented 4 years ago

在算优势函数R(s)-V(s)的时候两者维度不匹配,算V之前是加了一维最终是两维,R是一维,后面用这个advantage去乘policy也就有问题了