huangeddie / MuZeroGoJax

Mu Zero Go implemented with JAX and GoJAX
MIT License
9 stars 0 forks source link

Make max max action capacity dynamically equal to max actions sampled #250

Closed huangeddie closed 1 year ago