daisatojp / mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
GNU General Public License v3.0
70 stars 19 forks source link

Update for pytorch and gym v0.26.0 breaking change. #15

Closed zhenpingfeng closed 1 year ago