Open saikrishna-1996 opened 2 years ago
We need R2D2 for Hanabi agent and current implementations in cogment do not have recurrent networks in RL. We should reproduce the results of R2D2: https://openreview.net/pdf?id=r1lyTjAqYX
We need R2D2 for Hanabi agent and current implementations in cogment do not have recurrent networks in RL. We should reproduce the results of R2D2: https://openreview.net/pdf?id=r1lyTjAqYX