issues
search
rhalbersma
/
doctrina
Exercises in reinforcement learning
Boost Software License 1.0
3
stars
0
forks
source link
Implement all the classic tabular methods
#1
Open
rhalbersma
opened
4 years ago
rhalbersma
commented
4 years ago
[x] policy iteration
[x] value iteration
[ ] MC importance sampling
[ ] MC weighted importance sampling
[x] SARSA
[x] Q-learning
[x] expected SARSA
[ ] double Q-learning