uvipen / Super-mario-bros-PPO-pytorch

Proximal Policy Optimization (PPO) algorithm for Super Mario Bros
MIT License
1.07k stars 201 forks source link

I think the next step you can use one model weights to play all world #22

Open zhaoyue3513247 opened 1 year ago