Dose this code "Policy Gradient/Doom" really work?

simoninithomas / Deep_reinforcement_learning_Course

Implementations from the free course Deep Reinforcement Learning with Tensorflow and PyTorch

http://www.simoninithomas.com/deep-rl-course

3.77k stars 1.22k forks source link

Dose this code "Policy Gradient/Doom" really work? #84

Open andersonhusky opened 2 years ago

andersonhusky commented 2 years ago

I learn Chapter5 and write Policy Gradient into tf 2.0 according to "Policy Gradient/Doom", and I just wonder if this code is really work. Because after a night of training, the agent does'nt look like it can recognize aid kit, and the output probability of my Network is just around 0.29~0.35, is my code wrong?