joshmiller17 / spirecomm

Slay the Spire AI using an evolving behaviour tree and MCTS forward simulations.
MIT License
2 stars 0 forks source link

MCTS to predict until random #26

Open joshmiller17 opened 5 years ago

joshmiller17 commented 5 years ago

Instead of MCTS returning one action at a time, it could predict the next sequence of actions up until a random element is introduced, such as:

  1. Monsters change intents
  2. Card has random effect
  3. Draw

States/Actions can mark themselves as random or not.

Would be interesting to see its top ~3 unique sequence choices in order of predicted value.

joshmiller17 commented 5 years ago

Idea: create a reward function which approximates the value of a state based on designer heuristics, used only to guide MCTS rollout policy probabilities for more efficient searches