issues
search
LyWangPX
/
Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions
Solutions of Reinforcement Learning, An Introduction
MIT License
2.04k
stars
465
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
correction for Ex4.2.py
#99
Yelinz
opened
6 months ago
0
Exercise 7.1: Value estimates don't change from step to step
#98
Nafyaz
closed
9 months ago
1
Exercise 4.6: Changes in Policy Iteration algorithm for epsilon-soft policies
#97
MoeenTB
opened
1 year ago
0
Exercise 3.4 - 0 probability rows
#96
gblawrence03
opened
1 year ago
0
Exercise 4.8: Avoid numerical instability in policy
#95
mhoehle
opened
1 year ago
0
Exercise 3.18
#94
ghost
opened
2 years ago
0
Exercise 10.5
#93
ShawnHymel
opened
2 years ago
0
Exercise 12.13
#92
thisisWooyeol
closed
2 years ago
0
Improved script for ex. 4.7
#91
CorentinJ
opened
2 years ago
0
a very small error
#90
PandaZhym
opened
2 years ago
1
Update 2_4.py
#89
kiril-chilingarashvili
opened
3 years ago
0
Chapter 13 - Policy gradient
#88
castorfou
opened
3 years ago
0
Exercise 2.3
#87
ShaowuChen
closed
3 years ago
1
[Ex 4.5] Deterministic policy
#86
Jonathan2021
opened
3 years ago
0
[Ex 4.2] Changing dynamics changes the state values
#85
Jonathan2021
opened
3 years ago
1
How to contribute?
#84
ovasseur
opened
3 years ago
1
Exercise 3.29 might have a mistake
#83
rvitorper
opened
3 years ago
0
Ex4.7-A possible bug
#82
khbalhandawi
opened
3 years ago
2
Exercise4.9
#81
soonjune
opened
3 years ago
3
Excercise 3.5
#80
manu2000
opened
3 years ago
0
Execise 6.1
#79
RangerChu
opened
3 years ago
2
Ex 4.1
#78
StoyanVenDimitrov
opened
3 years ago
1
Add Ex 7.2
#77
JChunX
opened
3 years ago
0
Where do the excercise come from because I cannot find them in the original book?
#76
zhiqianglucky
opened
3 years ago
2
question about exercise 5.13
#75
315930399
opened
4 years ago
0
Exercise 4.6
#74
jerryfrancis-97
opened
4 years ago
0
Exercise 3.28
#73
NakuraMino
closed
4 years ago
3
Exercise 6.1
#72
Kaniee
closed
4 years ago
1
Exercise 4.5
#71
Kaniee
closed
4 years ago
1
Idea: Add source files of PDFs to repo
#70
Kaniee
closed
4 years ago
1
Exercise 4.8
#69
LevinCeglie
closed
4 years ago
1
Solution to chapter 2
#68
ZhangNanXi
closed
4 years ago
1
Ex 4.7-A
#67
Avalpreet
closed
4 years ago
2
Clarification Ex 4.5_ Deterministic Policy
#66
Avalpreet
closed
4 years ago
2
Ex 3.5
#65
franzoni315
closed
4 years ago
2
Ch 10 Ex. 10.6
#64
KimMatt
closed
4 years ago
2
typo in 3.9
#63
harristeague
closed
4 years ago
1
added ex4.2 for chapter 4
#62
stchau4work
closed
4 years ago
0
106107 corrections
#61
dipplestix
closed
4 years ago
0
ex 10.6 and 10.7 don't match book
#60
dipplestix
closed
4 years ago
2
Exercise 5.3
#59
NishanthARao
closed
4 years ago
1
Add ex8.8 code and plot
#58
burmecia
closed
4 years ago
0
Add ex8.4 code and plot
#57
burmecia
closed
4 years ago
1
ex4.5 solution modification
#56
xinyuan-huang
closed
4 years ago
2
add averaged result and plot
#55
burmecia
closed
4 years ago
0
Error in solution for 12.2
#54
gakshaygupta
closed
4 years ago
4
error in exercise 11.3
#53
gakshaygupta
closed
4 years ago
8
Ex 6.13
#52
burmecia
closed
4 years ago
2
Ex 6.11
#51
burmecia
closed
4 years ago
1
add exercise 6.9 and 6.10 code
#50
burmecia
closed
4 years ago
3
Next