A third year uni project aiming to implement and evaluate the EFR algorithm with different deviation types and explore a potential tradeoff between exploitability and expected value of a strategy in practice.
(EXCL. project management as this will be covered before the evaluation/conclusion)
Experiment details
-Leduc Poker setting, fictitious self play,
-Iterations, which deviation subsets tested.
-Explain metrics used and provide a formal definition.
(EXCL. project management as this will be covered before the evaluation/conclusion)
Experiment details -Leduc Poker setting, fictitious self play, -Iterations, which deviation subsets tested. -Explain metrics used and provide a formal definition.