vietnh1009 / Super-mario-bros-PPO-pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
MIT License
1.09k stars 206 forks source link

I think the next step you can use one model weights to play all world #22

Open zhaoyue3513247 opened 1 year ago