Thanks for the code. I have a question, what is the advantage of playing so many rounds (rounds = 10 in the code)? It doesn't seem to learn from previous rounds, as sometimes it is taking longer steps in later rounds.
BTW, how to print out the steps only for the round that has smallest steps to reach the goal?
Thanks for the code. I have a question, what is the advantage of playing so many rounds (rounds = 10 in the code)? It doesn't seem to learn from previous rounds, as sometimes it is taking longer steps in later rounds. BTW, how to print out the steps only for the round that has smallest steps to reach the goal?
thanks