Closed robjlyons closed 3 years ago
The agent trains on sequences of length 50. If the agent collects an episode that ends earlier, it will be skipped during data loading. It's nothing to worry about, unless the majority of your episode are shorter than 50.
May I ask what this means?
Is there something wrong with my env?
Thanks,