Closed richardrl closed 5 years ago
I believe full_observations, observations, reward, terminals should have the same length
This extra append makes full_observation +1 over the other lists of the path
I believe full_observations, observations, reward, terminals should have the same length
This extra append makes full_observation +1 over the other lists of the path