Open tkchouaki opened 5 years ago
Yes, it seems you are right, the comment says "goes left" for east and "goes right" for west, which sounds wrong ;)
The good point about RL is that the agent does not care about the meaning of actions, so it will take appropriate actions anyways. But this needs to be fixed, it'll be done asap.
Thank you for spotting this.
The maze_plotter should also be modified to make the arrows point to the right direction.
No, to me the good idea rather consists in inverting east and west everywhere in the comments, these notions make no sense from the program point of view (the program is not wrong in that respect)
I believe the East and West directions are inverted in the transition function. In fact, running this code (which asks the agent to go west) in the mdp.ipynb notebook gives the result below
I believe it is due to the initialization of the transition function in this portion of code
Inverting the directions handled by the two portions of code should solve the problem