ying-wen / malib_deprecated

A Multi-agent Learning Framework
MIT License
61 stars 16 forks source link

PPO Implementation #1

Open Phutoast opened 5 years ago

Phutoast commented 5 years ago

In order to gain more understanding of the interface of the repo, we should try to implement PPO from scratch https://arxiv.org/abs/1707.06347