LyWangPX Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions issues

LyWangPX / Reinforcement-Learning-2nd-Edition-by-Sutton-Exercise-Solutions

Solutions of Reinforcement Learning, An Introduction

MIT License

2.04k stars 465 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

correction for Ex4.2.py

#99 Yelinz opened 6 months ago
0
Exercise 7.1: Value estimates don't change from step to step

#98 Nafyaz closed 9 months ago
1
Exercise 4.6: Changes in Policy Iteration algorithm for epsilon-soft policies

#97 MoeenTB opened 1 year ago
0
Exercise 3.4 - 0 probability rows

#96 gblawrence03 opened 1 year ago
0
Exercise 4.8: Avoid numerical instability in policy

#95 mhoehle opened 1 year ago
0
Exercise 3.18

#94 ghost opened 2 years ago
0
Exercise 10.5

#93 ShawnHymel opened 2 years ago
0
Exercise 12.13

#92 thisisWooyeol closed 2 years ago
0
Improved script for ex. 4.7

#91 CorentinJ opened 2 years ago
0
a very small error

#90 PandaZhym opened 2 years ago
1
Update 2_4.py

#89 kiril-chilingarashvili opened 3 years ago
0
Chapter 13 - Policy gradient

#88 castorfou opened 3 years ago
0
Exercise 2.3

#87 ShaowuChen closed 3 years ago
1
[Ex 4.5] Deterministic policy

#86 Jonathan2021 opened 3 years ago
0
[Ex 4.2] Changing dynamics changes the state values

#85 Jonathan2021 opened 3 years ago
1
How to contribute?

#84 ovasseur opened 3 years ago
1
Exercise 3.29 might have a mistake

#83 rvitorper opened 3 years ago
0
Ex4.7-A possible bug

#82 khbalhandawi opened 3 years ago
2
Exercise4.9

#81 soonjune opened 3 years ago
3
Excercise 3.5

#80 manu2000 opened 3 years ago
0
Execise 6.1

#79 RangerChu opened 3 years ago
2
Ex 4.1

#78 StoyanVenDimitrov opened 3 years ago
1
Add Ex 7.2

#77 JChunX opened 3 years ago
0
Where do the excercise come from because I cannot find them in the original book?

#76 zhiqianglucky opened 3 years ago
2
question about exercise 5.13

#75 315930399 opened 4 years ago
0
Exercise 4.6

#74 jerryfrancis-97 opened 4 years ago
0
Exercise 3.28

#73 NakuraMino closed 4 years ago
3
Exercise 6.1

#72 Kaniee closed 4 years ago
1
Exercise 4.5

#71 Kaniee closed 4 years ago
1
Idea: Add source files of PDFs to repo

#70 Kaniee closed 4 years ago
1
Exercise 4.8

#69 LevinCeglie closed 4 years ago
1
Solution to chapter 2

#68 ZhangNanXi closed 4 years ago
1
Ex 4.7-A

#67 Avalpreet closed 4 years ago
2
Clarification Ex 4.5_ Deterministic Policy

#66 Avalpreet closed 4 years ago
2
Ex 3.5

#65 franzoni315 closed 4 years ago
2
Ch 10 Ex. 10.6

#64 KimMatt closed 4 years ago
2
typo in 3.9

#63 harristeague closed 4 years ago
1
added ex4.2 for chapter 4

#62 stchau4work closed 4 years ago
0
106107 corrections

#61 dipplestix closed 4 years ago
0
ex 10.6 and 10.7 don't match book

#60 dipplestix closed 4 years ago
2
Exercise 5.3

#59 NishanthARao closed 4 years ago
1
Add ex8.8 code and plot

#58 burmecia closed 4 years ago
0
Add ex8.4 code and plot

#57 burmecia closed 4 years ago
1
ex4.5 solution modification

#56 xinyuan-huang closed 4 years ago
2
add averaged result and plot

#55 burmecia closed 4 years ago
0
Error in solution for 12.2

#54 gakshaygupta closed 4 years ago
4
error in exercise 11.3

#53 gakshaygupta closed 4 years ago
8
Ex 6.13

#52 burmecia closed 4 years ago
2
Ex 6.11

#51 burmecia closed 4 years ago
1
add exercise 6.9 and 6.10 code

#50 burmecia closed 4 years ago
3