BlueDi / DeepDip

DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA
GNU General Public License v3.0
12 stars 2 forks source link

Improve DeepDip #5

Closed BlueDi closed 5 years ago

BlueDi commented 5 years ago

Improve DeepDip to its final version, v1.

This should be achieved by implementing known improvements to the DQN algorithm such as Dueling Networks, or Prioritized Experience Replay.

BlueDi commented 5 years ago

The DQN was not used due to incompatibilities explained in #4. ACKTR was implemented and on this phase, it was replaced with PPO.