MCTS to predict until random

joshmiller17 / spirecomm

Slay the Spire AI using an evolving behaviour tree and MCTS forward simulations.

MIT License

2 stars 0 forks source link

MCTS to predict until random #26

Open joshmiller17 opened 5 years ago

joshmiller17 commented 5 years ago

Instead of MCTS returning one action at a time, it could predict the next sequence of actions up until a random element is introduced, such as:

Monsters change intents
Card has random effect
Draw

States/Actions can mark themselves as random or not.

Would be interesting to see its top ~3 unique sequence choices in order of predicted value.

joshmiller17 commented 5 years ago

Idea: create a reward function which approximates the value of a state based on designer heuristics, used only to guide MCTS rollout policy probabilities for more efficient searches