carpedm20 / NAF-tensorflow

"Continuous Deep Q-Learning with Model-based Acceleration" in TensorFlow
MIT License
193 stars 59 forks source link

There is no optimization of prediction network in run2 function #10

Open sasforce opened 6 years ago

sasforce commented 6 years ago

As the title, in run2 function, should the optimization be add ahead of updating of target network?