ikostrikov / pytorch-trpo

PyTorch implementation of Trust Region Policy Optimization
MIT License
433 stars 91 forks source link

Object oriented #19

Open GittiHab opened 4 years ago

GittiHab commented 4 years ago

It would be nice if the agent was an object (with methods "get_action" and "remember" or similar) so that it could be more easily reused.