其他两个球学不到正确的策略

starry-sky6688 / MADDPG

Pytorch implementation of the MARL algorithm, MADDPG, which correspondings to the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments".

537 stars 83 forks source link

其他两个球学不到正确的策略 #42

Closed Rhoslynnn closed 7 months ago

Rhoslynnn commented 7 months ago

我将ANN替换成了SNN，接口都和您的代码保持一致，但是回报率一直稳定在600左右，最后发现是只有一个球可以学到正确的策略，其他两个球都在原地打转，可以问一下是什么原因导致的吗

starry-sky6688 commented 7 months ago

既然只换了神经网络，那多半是网络的问题了