huangeddie / MuZeroGoJax

Mu Zero Go implemented with JAX and GoJAX
MIT License
9 stars 0 forks source link

Pass sampled actions and partial transition values from self play to train step #201

Closed huangeddie closed 1 year ago

huangeddie commented 1 year ago