cogment / cogment-verse

Research platform for Human-in-the-loop learning (HILL) & Multi-Agent Reinforcement Learning (MARL)
https://cogment.ai/cogment_verse
Apache License 2.0
78 stars 14 forks source link

R2D2 #44

Open saikrishna-1996 opened 2 years ago

saikrishna-1996 commented 2 years ago

We need R2D2 for Hanabi agent and current implementations in cogment do not have recurrent networks in RL. We should reproduce the results of R2D2: https://openreview.net/pdf?id=r1lyTjAqYX