Open zplizzi opened 2 years ago
https://github.com/zplizzi/pytorch-ppo/blob/master/main.py#L132
yeah, this is definitely backwards. should go through all batches each epoch.
https://github.com/zplizzi/pytorch-ppo/blob/master/main.py#L132