pat-coady / trpo

Trust Region Policy Optimization with TensorFlow and OpenAI Gym
https://learningai.io/projects/2017/07/28/ai-gym-workout.html
MIT License
360 stars 106 forks source link

Using train_on_batch #35

Open ryanmaxwell96 opened 4 years ago

ryanmaxwell96 commented 4 years ago

Ok,

I'm sorry again for so many questions, but why does train_on_batch only have one input? On the keras documentation it sounds like the format is supposed to be

train_on_batch(object, x, y, class_weight = NULL, sample_weight = NULL).

So why does this file's implementation look like

train_on_batch([arrays])?

bernardocortez commented 4 years ago

Train on batch will execute the call method of the TRPO class