I was wondering how you would extend an existing algorithm, let's say es to allow for a recurrent neural network instead of a feedforward one. You would need to change make_policy_network to replace the MLP with a newly created RNN so my impression is that you'd need to change only networks.py and not train.py. But not sure how to implement it
I was wondering how you would extend an existing algorithm, let's say es to allow for a recurrent neural network instead of a feedforward one. You would need to change
make_policy_network
to replace the MLP with a newly created RNN so my impression is that you'd need to change onlynetworks.py
and nottrain.py
. But not sure how to implement it