Debug/Benchmark Async PPO implementation

cogment / cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)

https://cogment.ai/cogment_verse

Apache License 2.0

80 stars 15 forks source link

Closed cloderic closed 1 year ago

lhnguyen102 commented 1 year ago

Here is the performance of APPO on hopper-v4. You also can find the hopper benchmark in [1]. reward_comp_hopper