Texas holdem OpenAi gym poker environment with reinforcement learning based on keras-rl. Includes virtual rendering and montecarlo for equity calculation.
MIT License
612
stars
168
forks
source link
after 700 k iterations agent just learns to go all in #78
Hello,
After 700 k iterations agent just learns to go all in all the time. Maybe the reward architecture should be different?
What is your experience?
Best regards, Roberts