Closed mohanr closed 7 years ago
Your code is one of the exercises from Chap. 1 ?
No, I just wrote it. It's not intent to be an answer to some exercise.
What is the result from 'run' or 'runs' that proves the learning is happening ?
To be honest I don't know. I didn't refer to the lisp code because I don't have any knowledge about lisp.
Did you create the example with a view to actually prove the RL algorithm learns ?
Yeah, there is a terminal interface. One can play with the program.
Your code is one of the exercises from Chap. 1 ? Mine is the port of the tictactoe lisp example to Haskell(https://github.com/mohanr/Reinforcement-Learning-An-Introduction-by-Richard-S.-Sutton-and-Andrew-G.-Barto/blob/master/tictactoe.hs)
What is the result from 'run' or 'runs' that proves the learning is happening ? My code returns a value between 40 and 50.
I mean this section.
My results are like this.
Did you create the example with a view to actually prove the RL algorithm learns ?