JoeyAndres / rl

Reinforcement Learning library
2 stars 0 forks source link

Softmax suboptimal action choice. #39

Open JoeyAndres opened 7 years ago