cogment / cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)
https://cogment.ai/cogment_verse
Apache License 2.0
76 stars 14 forks source link

150 make selfplay pong work #158

Closed lhnguyen102 closed 1 year ago

lhnguyen102 commented 1 year ago

Description This PR implements Asynchronous Proximal Policy Optimization (APPO) for the Petting Zoo pong environment. The objective is to achieve performance levels that are comparable to those mentioned in the literature.

Related Issue

150

Change Made

Additional Notes

closes #150