Do we need to switch to simulation mode within a main action?
For example,
A main action is to decide from (PLAY-CARD, HERO-POWER, END-TURN)
Assume we were in selection mode at this main action node, the UCB policy is used to determined from these choices.
Assume we choose the PLAY-CARD action
Assume this is the FIRST TIME we make this choice, so a new node is added to the game tree.
Now, do we want to switch to simulation mode?
In current design, we only switch to simulation mode after this MAIN ACTION + SUB ACTIONS are done.
That is, we switch to simulation after
added a node for PLAY-CARD
added a node for CHOOSE-HAND-CARD
added a node for CHOOSE-TARGET (if any)
more nodes for callback (if any)
Now, after this main action is done, we switched to simulation mode.
Do we need to switch to simulation mode within a main action?
For example, A main action is to decide from (PLAY-CARD, HERO-POWER, END-TURN)
Assume we were in selection mode at this main action node, the UCB policy is used to determined from these choices. Assume we choose the PLAY-CARD action Assume this is the FIRST TIME we make this choice, so a new node is added to the game tree.
Now, do we want to switch to simulation mode?
In current design, we only switch to simulation mode after this MAIN ACTION + SUB ACTIONS are done. That is, we switch to simulation after