ShangtongZhang reinforcement-learning-an-introduction issues

ShangtongZhang / reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

MIT License

13.37k stars 4.8k forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

Add CITATION.cff

#163 RensOliemans opened 1 year ago
4
Citing this repository

#162 RensOliemans opened 1 year ago
0
Chapter 2: Couldn't find the file '../images/figure_2_1.png'

#161 Zhangxiaoyi688 opened 1 year ago
0
(fix): ten_armed_testbed.py np.float

#160 iw4p closed 1 year ago
0
Fixed START, GOAL state

#159 MichaelQiYinChen opened 1 year ago
0
chapter4 gamblers_problem, showing multiple best actions

#158 itschenxi opened 1 year ago
0
ch06 random_walk td method

#157 Perseus1993 opened 1 year ago
1
l

#156 Karp8841 opened 1 year ago
0
Unclear point for the code in Blackjack example

#155 eatam opened 2 years ago
1
Wrong Bellman equation for Jack's car rental problem?

#154 Raymondliz closed 2 years ago
1
The plicy of chapter1

#153 benroo123 opened 2 years ago
1
Problem of excercise 2.5

#152 qiqiJiang-st opened 2 years ago
0
example to use it on human genetic data?

#151 Shicheng-Guo opened 2 years ago
0
problem about chapter04/car_rental.py

#150 shaoeChen opened 2 years ago
1
ten_armed_testbed.py中的figure2_3为何不用“sample_averages”

#149 A-Pai opened 3 years ago
0
Minor changes

#148 VEXLife closed 2 years ago
1
wrong figure number for chapter 11

#147 arashHaratian opened 3 years ago
0
typo

#146 arashHaratian closed 2 years ago
0
tictactoe compete() plays 1000 almost identical games

#145 gsverhoeven opened 3 years ago
1
add script that reproduces example 12.14

#144 Johann-Huber closed 3 years ago
1
Figure 5.3 change

#143 VEXLife closed 3 years ago
2
Change the axis limit and offset.

#142 VEXLife closed 3 years ago
1
Generalization to abstract classes for Environment/Agents?

#141 chicotobi closed 3 years ago
2
Patch 1

#140 VEXLife closed 3 years ago
2
something wrong in matplotlib

#139 FYYFU opened 3 years ago
2
Update trajectory_sampling.py

#138 vinnik-dmitry07 closed 3 years ago
0
docs: fix simple typo, resoultion -> resolution

#137 timgates42 closed 3 years ago
1
nit: chapter 6 references

#136 mahiuchun opened 3 years ago
0
A simpler draw function

#135 rohitdavas opened 3 years ago
2
Unable to get the same results while formulating differently

#134 rohitdavas closed 3 years ago
1
No related package on the zip file

#133 leiyongxiang1205 closed 3 years ago
1
add state labels on the tables

#132 yasutak closed 4 years ago
1
reinforcement-learning

#131 yang-chenyu104 closed 4 years ago
0
Add code to draw optimal policy

#130 rogertrullo closed 4 years ago
1
Add linear system to gridworld

#129 rogertrullo closed 4 years ago
1
Help on ten_armed_testbed.py

#128 ai4pharma closed 4 years ago
3
Chapter4, gambler problem

#127 07hyx06 closed 4 years ago
1
Chapter 11

#126 mattgithub1919 closed 4 years ago
12
chap1/tic_tac_toc.py why does make td_error zero when exploring

#125 GarfieldF closed 4 years ago
1
chapter04/car_rental_synchronous.py: the table needs to be flipped.

#124 QuangTran4810 closed 4 years ago
1
chapter06/random_wark.py

#123 ChenHuaYou closed 4 years ago
1
a little confuse about chapter5/blackjack.py

#122 ChenHuaYou closed 4 years ago
2
chapter04/gamblers_problem.py line33 to 62 may has a problem

#121 ChenHuaYou closed 4 years ago
2
Reinforcement learning

#120 palbha closed 4 years ago
1
Update figures 13_1 and 13_2

#119 scrpy closed 4 years ago
1
discount factor for Chapter 10

#118 roachsinai closed 4 years ago
1
Misunderstanding in chapter 2

#117 zZthebreakerZz closed 4 years ago
1
Tile Coding scaling issue

#116 MJeremy2017 closed 4 years ago
2
Fix usable_ace_player bug, fix indention error, set POLICY_PLAYER dty…

#115 goal closed 5 years ago
1
How to formulate problem with State is a combination of multiple factors?

#114 MJeremy2017 closed 5 years ago
1