issues
search
daisatojp
/
mpo
PyTorch Implementation of the Maximum a Posteriori Policy Optimisation
GNU General Public License v3.0
72
stars
19
forks
source link
learn Discrete Action Space
#2
Closed
daisatojp
closed
4 years ago
daisatojp
commented
4 years ago
enable model to learn Discrete Action Space
enable model to learn Discrete Action Space