CVHvn / Mario_PPO_RND

Playing Super Mario Bros with Proximal Policy Optimization (PPO) and Random Network Distillation (RND)
MIT License
2 stars 0 forks source link