There is no optimization of prediction network in run2 function

carpedm20 / NAF-tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow

MIT License

193 stars 59 forks source link

Open sasforce opened 6 years ago

sasforce commented 6 years ago

As the title, in run2 function, should the optimization be add ahead of updating of target network?