DeepReinforcementLearning / DeepReinforcementLearningInAction

Code from the Deep Reinforcement Learning in Action book from Manning, Inc
MIT License
734 stars 306 forks source link

Chp 7 - Ch7_DistDQN.ipynb #9

Open ghost opened 4 years ago

ghost commented 4 years ago

Is there an error in the training loop code for playing Atari-Freeway: specifically generating the predictions?

_pred2_batch = dist_dqn(state2_batch.detach(),theta2,aspace=aspace)

Should the state2_batch be state_batch?

outlace commented 4 years ago

I believe the original is correct. Does it not work for you?