because of the use of yield,when we get the last state from history,we can not get the total_rewards and total_steps in previous code. We can get them together with reset the state,but it is the next episode.
So i
1、modify the code behind "if is_done: ..."
2、Considering that could have many simple in "history"
because of the use of yield,when we get the last state from history,we can not get the total_rewards and total_steps in previous code. We can get them together with reset the state,but it is the next episode.
So i 1、modify the code behind "if is_done: ..." 2、Considering that could have many simple in "history"
do this modify,please approve,thanks.