fix bug in reinforce / fix bug in replay buffer / add train bash

hsvgbkhgbv / SQDDPG

This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.

114 stars 42 forks source link

fix bug in reinforce / fix bug in replay buffer / add train bash #4

Closed mikezhang95 closed 5 years ago