MatthewJA / Inverse-Reinforcement-Learning

Implementations of selected inverse reinforcement learning algorithms.
MIT License
957 stars 238 forks source link

sum of Gridworld.transition_probability is not 1 #1

Closed yosh7of9 closed 8 years ago

yosh7of9 commented 8 years ago

gw = gridworld.Gridworld(5, .3, .2) gw.transition_probability[7,0,:].reshape(5,5)

outputs

array([ [ 0. , 0. , 0.075, 0. , 0. ], [ 0. , 0.075, 0.075, 0.775, 0. ], [ 0. , 0. , 0.075, 0. , 0. ], [ 0. , 0. , 0. , 0. , 0. ], [ 0. , 0. , 0. , 0. , 0. ]])

But should sum of this be 1 ?

MatthewJA commented 8 years ago

Ah, good catch! I'll have a shot at fixing this shortly.