few questions about setting hyperparameters

google-deepmind / mctx

Monte Carlo tree search in JAX

Apache License 2.0

2.33k stars 188 forks source link

In go game, the default value for "max_num_considered_actions" is 16. I want to know if this value should increase with the increase of num_simulations？Also maxvisit_init ? What value is better for max_num_considered_actions when num_simulations is 400 and 800 and 1600? There are a few other hyperparameters that I'm not sure how to set optimally. I hope you can help me check how the settings are. Do I need to adjust qtransform? for go 19*19, my settings is: policy_output = mctx.gumbel_muzero_policy( ... num_simulations=400, max_num_considered_actions = 32, qtransform=partial( mctx.qtransform_completed_by_mix_value, value_scale=0.1, maxvisit_init=50, rescale_values=False, ), gumbel_scale=1.0, )

google-deepmind / mctx

few questions about setting hyperparameters #62