marlbenchmark / off-policy

PyTorch implementations of popular off-policy multi-agent reinforcement learning algorithms, including QMix, VDN, MADDPG, and MATD3.
MIT License
395 stars 67 forks source link

Run time #10

Open Bruce-Lan-ZY opened 2 years ago

Bruce-Lan-ZY commented 2 years ago

How much time is usually needed when running on mpe by Qmix?

Bruce-Lan-ZY commented 2 years ago

10million steps