google-deepmind / mctx

Monte Carlo tree search in JAX
Apache License 2.0
2.33k stars 189 forks source link

Recommend to use the Q-value of the selected action to estimate the value of the root state. #12

Closed copybara-service[bot] closed 2 years ago

copybara-service[bot] commented 2 years ago

Recommend to use the Q-value of the selected action to estimate the value of the root state.