issues
search
vojtamolda
/
reinforcement-learning-an-introduction
Solutions to exercises in Reinforcement Learning: An Introduction (2nd Edition).
325
stars
71
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Excercise 7.7
#25
IAMHAADICOOL
opened
4 months ago
0
Exercise 3.21
#24
rjscherrer
opened
1 year ago
0
[Exercise 04.07] possible issue while calculating the transfer_cost if not all the cars indicated by the action can be transferred
#23
duxtinto
opened
1 year ago
1
Exercise 3.16
#22
rjscherrer
opened
1 year ago
1
Exercise 3.9
#21
rjscherrer
closed
1 year ago
1
Exercise 2.2: A5 is also a random action
#20
duxtinto
closed
1 year ago
2
[Exercise 2.3] probability of picking the optimal action for epsilon = 0
#19
minimental
closed
1 year ago
2
[Exercise7.2] NameError: name 'environment' is not defined
#18
Xuanaxx
closed
1 year ago
1
Error in "Exercise 8.4*.ipynb" --> "TypeError: dispatcher for __array_function__ did not return an iterable"
#17
MariosGkMeng
opened
1 year ago
2
error when running sarsa code
#16
sinamcr7
closed
1 year ago
2
Ex2.1
#15
torayeff
opened
2 years ago
0
[Exercise 1.1] Self-Play
#14
lijiyao919
closed
2 years ago
2
[Exercise 2.2] Question about solution (possible error)
#13
ShawnHymel
closed
2 years ago
4
[Exercise 3.9] Typo and a Better Solution
#12
Bbbstin
closed
2 years ago
0
[Exercice 3.26] The expression of q* should not have max over the action a
#11
anonymous-pusher
opened
2 years ago
0
Error to Exercise 2.4
#10
earthwuyang
closed
2 years ago
2
Error in solution to Exercise 2.2
#9
earthwuyang
closed
2 years ago
3
Exercise 4.2 & 4.6
#8
hhoppe
closed
2 years ago
7
Exercise 3.22
#7
hhoppe
closed
2 years ago
3
Truncated typed answers in all exercises in chapters 3, 4, 6 and 9
#6
Jonathan2021
closed
2 years ago
2
[Exercise 5.14] Errors in updates, time indices and more
#5
Jonathan2021
opened
3 years ago
2
Exercise 5.6: No averaging and uncoherent times.
#4
Jonathan2021
closed
2 years ago
4
Question. Chapter 5, exercise 5.12: reversed return sequence
#3
boldyshev
closed
3 years ago
2
Solution to Exercise 8.1 is incomplete in the pdf
#2
umeshksingla
closed
3 years ago
2
Exercise 5.6: No discount appearing in equation
#1
Jonathan2021
closed
3 years ago
3