icoxfog417 / baby-steps-of-rl-ja

Pythonで学ぶ強化学習 -入門から実践まで- サンプルコード
Apache License 2.0
431 stars 262 forks source link

Fix issue #6: Calculation of Expected Reward of Policy Iteration #7

Closed icoxfog417 closed 5 years ago

icoxfog417 commented 5 years ago

Fix issue #6: Calculation of Expected Reward of Policy Iteration