Example for Continued PPO training after GAIL?

Khrylx / PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Learning (GAIL). Fast Fisher vector product TRPO.

MIT License

1.09k stars 186 forks source link

Example for Continued PPO training after GAIL? #15

Open signalprime opened 5 years ago

signalprime commented 5 years ago

Thank you so much for sharing these implementations in PyTorch with the community. I was curious if you have an example to continue PPO training with a saved model from the GAIL process?