Jamesflynn1 / CS344-Opponent-Exploitation-Poker

A third year uni project aiming to implement and evaluate the EFR algorithm with different deviation types and explore a potential tradeoff between exploitability and expected value of a strategy in practice.
0 stars 0 forks source link

Learning algorithms for game theory #35

Closed Jamesflynn1 closed 10 months ago

Jamesflynn1 commented 1 year ago

Regret definition

Rationality and the problems with learning against an opponents strategy (exploitation)

What is no regret (no internal, no phi regret ect) learning

Regret matching

Fictitious self play

CFR and primary theorems