This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled by ''Shapley Q-value: A Local Reward Approach to Solve Global Reward Games''.
114
stars
42
forks
source link
fix bug in reinforce / fix bug in replay buffer / add train bash #4