daisatojp / mpo

PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
GNU General Public License v3.0
72 stars 19 forks source link

learn Discrete Action Space #2

Closed daisatojp closed 4 years ago

daisatojp commented 4 years ago

enable model to learn Discrete Action Space