issues
search
ShangtongZhang
/
reinforcement-learning-an-introduction
Python Implementation of Reinforcement Learning: An Introduction
MIT License
13.37k
stars
4.8k
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Add CITATION.cff
#163
RensOliemans
opened
1 year ago
4
Citing this repository
#162
RensOliemans
opened
1 year ago
0
Chapter 2: Couldn't find the file '../images/figure_2_1.png'
#161
Zhangxiaoyi688
opened
1 year ago
0
(fix): ten_armed_testbed.py np.float
#160
iw4p
closed
1 year ago
0
Fixed START, GOAL state
#159
MichaelQiYinChen
opened
1 year ago
0
chapter4 gamblers_problem, showing multiple best actions
#158
itschenxi
opened
1 year ago
0
ch06 random_walk td method
#157
Perseus1993
opened
1 year ago
1
l
#156
Karp8841
opened
1 year ago
0
Unclear point for the code in Blackjack example
#155
eatam
opened
2 years ago
1
Wrong Bellman equation for Jack's car rental problem?
#154
Raymondliz
closed
2 years ago
1
The plicy of chapter1
#153
benroo123
opened
2 years ago
1
Problem of excercise 2.5
#152
qiqiJiang-st
opened
2 years ago
0
example to use it on human genetic data?
#151
Shicheng-Guo
opened
2 years ago
0
problem about chapter04/car_rental.py
#150
shaoeChen
opened
2 years ago
1
ten_armed_testbed.py中的figure2_3为何不用“sample_averages”
#149
A-Pai
opened
3 years ago
0
Minor changes
#148
VEXLife
closed
2 years ago
1
wrong figure number for chapter 11
#147
arashHaratian
opened
3 years ago
0
typo
#146
arashHaratian
closed
2 years ago
0
tictactoe compete() plays 1000 almost identical games
#145
gsverhoeven
opened
3 years ago
1
add script that reproduces example 12.14
#144
Johann-Huber
closed
3 years ago
1
Figure 5.3 change
#143
VEXLife
closed
3 years ago
2
Change the axis limit and offset.
#142
VEXLife
closed
3 years ago
1
Generalization to abstract classes for Environment/Agents?
#141
chicotobi
closed
3 years ago
2
Patch 1
#140
VEXLife
closed
3 years ago
2
something wrong in matplotlib
#139
FYYFU
opened
3 years ago
2
Update trajectory_sampling.py
#138
vinnik-dmitry07
closed
3 years ago
0
docs: fix simple typo, resoultion -> resolution
#137
timgates42
closed
3 years ago
1
nit: chapter 6 references
#136
mahiuchun
opened
3 years ago
0
A simpler draw function
#135
rohitdavas
opened
3 years ago
2
Unable to get the same results while formulating differently
#134
rohitdavas
closed
3 years ago
1
No related package on the zip file
#133
leiyongxiang1205
closed
3 years ago
1
add state labels on the tables
#132
yasutak
closed
4 years ago
1
reinforcement-learning
#131
yang-chenyu104
closed
4 years ago
0
Add code to draw optimal policy
#130
rogertrullo
closed
4 years ago
1
Add linear system to gridworld
#129
rogertrullo
closed
4 years ago
1
Help on ten_armed_testbed.py
#128
ai4pharma
closed
4 years ago
3
Chapter4, gambler problem
#127
07hyx06
closed
4 years ago
1
Chapter 11
#126
mattgithub1919
closed
4 years ago
12
chap1/tic_tac_toc.py why does make td_error zero when exploring
#125
GarfieldF
closed
4 years ago
1
chapter04/car_rental_synchronous.py: the table needs to be flipped.
#124
QuangTran4810
closed
4 years ago
1
chapter06/random_wark.py
#123
ChenHuaYou
closed
4 years ago
1
a little confuse about chapter5/blackjack.py
#122
ChenHuaYou
closed
4 years ago
2
chapter04/gamblers_problem.py line33 to 62 may has a problem
#121
ChenHuaYou
closed
4 years ago
2
Reinforcement learning
#120
palbha
closed
4 years ago
1
Update figures 13_1 and 13_2
#119
scrpy
closed
4 years ago
1
discount factor for Chapter 10
#118
roachsinai
closed
4 years ago
1
Misunderstanding in chapter 2
#117
zZthebreakerZz
closed
4 years ago
1
Tile Coding scaling issue
#116
MJeremy2017
closed
4 years ago
2
Fix usable_ace_player bug, fix indention error, set POLICY_PLAYER dty…
#115
goal
closed
5 years ago
1
How to formulate problem with State is a combination of multiple factors?
#114
MJeremy2017
closed
5 years ago
1
Next