impl Policy such that it encapsulates random choice

krukah / robopoker

Play, learn, solve, and analyze No-Limit Texas Hold Em. Implementation follows from Monte Carlo counter-factual regret minimization over with hierarchical K-means imperfect recall abstractions.

MIT License

43 stars 5 forks source link

impl Policy such that it encapsulates random choice #11

Closed krukah closed 1 month ago

krukah commented 8 months ago

Policy { choices: Vec<[Action, Weight]> }